FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACAT T T TGTGAAGT TGTAAAACAGAAAACCTGT T AGAAATGTGG T GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT TACTGCAGTAACACT CCACCATATAGACCCGGCT TTACCTTAT ATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCT TATT TGGGGCAATGCTAAATATTGCGGCAGT T T TATGCATT G 
CTACCAT T TAT GTT CGT TATAAGCAAGT T CATGC T C TGAGTCC TGAAGAGAACG TTATCAT CAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
H CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
M T AT ATATG T T TGT T CAGACCAT CC T T T CCTACCAAATGCAGCC CAAAAT CC AT GGCAAACAAG T C 

%l: TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 

yj atcagttttgcacagtggcaattttgggactgatttagaacagaaactcc^^ 

U1 acaaaggttatgtgcttcacatgatcactactgcagc^gaatggtctatgtcattttcct 

ggttttttcctgacttacattcgtgattttcagaaaatttctttacgggtggaagccaatttaca 
tggattaaccctctatgacactgcaccttgccctattaacaatgaacgaacacggctactttcca 
gagatatt tgat gaaaggataaaatatttctgtaatgattatgattctcagggattggggaaagg 
iji ttcacagaagttgcttattcttctctgaa^ttttcaaccacttaatcaa.ggctgacagtaacact 
Q gatgaatgctgataatcaggaaacatgaaj^gaagccatttgatagattattctaaaggatm 
yf caagaagactat taaaaacacct at gcctatact t t tt tatc tcagaaaataaagt caaaagact 

I 
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FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8 .39, ■ NX (S/T) : 0 

MWWFQQGLSFLPSALVTWTSAAFI FSYI TAVTLHHIDPALPYI SDTGTVAPEKCLFGAMLNIAAV 
LCIAT I YVRYKQVHALSPEENVI IKLNKAGLVLGILSCLGLS IVANFQKTTLFAAHVSGAVLTFG 
MGS LYMFVQT I LS YQMQPKI HGKQVFW I RLLL VI WCGVS AL SMLTCS S VLHSGNFGTDLE QKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 



m Type II transmembrane domain: 



Important features: 
Type II transmembr 

amino acids 13-33 



^ Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

|j| N-myristoylation sites. 

P amino acids 57-63, 95-101, 99-105, 124-130, 183-189 

ui . . 
rii 
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FIGURE 3 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGT AT T GAATAGCAGC TC C C G T T AT TTT C ACT G GAAAAT GAACC T GT GT GT AATT CT GC T GAT C CTG G 
TT T TCAT GGTGCCTTT TTACATTGGCTAT TTTATT GT GAGCAATATC CGACTACT GCATAAAGAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATT.TCTTCTGGAAACTAGGAGATCCCTTTCC 
Q CATTCT CAGCCCAAAAGATGGGAT CTTAT CCAT AGAACAGCT CATCAGCCGGGTT GGTGT GATT GGAG 

Ct TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
w I CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 

*t CAT AAGCAAAAAGAAAAGGAT GGCAATGGCACGGAGAACAATGTT CCAGAAGGGGGAAGT GCAT AACA 

111 

m AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 

y. CTT ATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCT TTTT CTGGAAACAGCTGATCT 

g ATATGCTACCAAGGAGAGAATAGAATACTCCAAA^CCTTCAAGGGGAAATATTTTAATTTTCTTGGTT 

Cj ACTTTTTCTCTATTTACTGTGTTTGGAAAA]?TTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 

Uf GGGAAA^CGGATCCTGTC^CmGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 

H' GAAGTT T TGGT CCCAACACATTTCCTT CATTCT TGT TGGAATAAT CAT CGTCACATCCATCAGAGGAT 

Ul 

~4 TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
?|'| CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATAC C GCAC CATAATC ACT GAAGT C C T T GGAGAAC T G C AGT T CAAC TT CT AT CACC GT TGGT TT G 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATT TTTCACCTT CATAGCATACTCCTTCCCCGT CAGGT GATACTATGACCAT GAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCAT CCCGT GTGGATAT GAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAilGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIG URE A 

MS FLIDSSIMITSQILFFGFGWLFFMRQLirKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPI LSPKHGI LS IEQLI SRVGVI GVTLMALLSGFGAWCPYT YMS YFLRNVTDTDI LALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQETOALEELSRQ 
LFLETADLYATKERIE YSKT FKGKYFNFLGYFFS I YCVWKI FMAT INI VFDRVGKTDPVTRGIE I 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS S VL L I RMSMPLE YRT 1 1 TE VLGE L Q FNFYHRW FDV I FLVS AL S S I L FL YLAHKQAPE KQMAP 

Important features: 
Signal peptide: 

amino acids 1-2 3 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 



AppJD-10063581 



Page 156 



FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATG GCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGC TAAGGAGGCCTGTAGGC TGC T GGGACTAAGT T TGGC CGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGT T T GCAGCC TAT TGT TACAAC TCATCT GATACT T GGACT AAC T CGTGCAT TCCAGAA 
AT T ATCACCACCAAAGAT CCCAT AT T CAACAC TCAAACT GCAACACAAACAACAGAAT T TAT TGT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
GAAAC TAGC ACC AT G T C T AC AGAAAC T GAAC CAT T T G T T G AAAAT AAAG C AG CAT T C AAG AAT G A 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGC TGGT CT T GGAT T T T GCT AT G T CAAAAGG TAT GT GAAGGCCTT C CC T T T T ACAAACAAGAAT 
CAGCAGAAGGAAAT GAT C GAAAC CAAAG TAGTAAAGGAGGAGAAGGC CAAT GATAGCAAC CC TAA 
T GAGGAAT CAAAGAAAACT GATAAAAACCCAGAAGAGT CCAAGAGT C CAAGCAAAAC TACCG TGC 
GAT GCC TGGAAGCTGAAGT T TAGAT G AG AC AG AAAT GAGGAGACACACCTGAGGCTGGTT TCTT T 
CAT GCTCCT TACCCTGCCCCAGCTGGGGAAAT CAAAAGGGCCAAAGAACCAAAGAAGAAAGT CC A 
CCCT TGGT TCCTAACTGGAATCAGCTCAGGAC TGCCAT TGGACTATGGAGTGCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
T T C TAG C C TG G C TAT GT C C TAAT AATAT C C CAC T GGGAGAAAG GAG T T T T G CAAAG T G CAAG GAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
ACAGACAGGGTCAAAGTGTT TTCTCTGAACACATTGAGTTGGAATCACTGT TTAGAACACACACA 
CTTACTTTTTCTGGTCTCTACCACTGCTG^TATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAAT AAAAAC T C T TATAAAT T TC TAT T T TTAT CTGAGT TACAGAAATGAT TAC TAAGGAAGAT T 
ACT CAG TAATT T GT T T AAAAAGTAATAAAAT TCAACAAACAT T TGCT GAAT AGC TACTAT AT GTC 
AAGTGC T G TGCAAGG T AT TACACT C TGTAAT TGAATAT TAT TCCTCAAAAAAT TGCACAT AGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TAT TAT TAACATACC TAAGAAGTAC AT TGTTACCTCTATATACCAAAGCAC AT T TTAAAAG T GC C 
ATT AACAAATG T AT CAC TAGCCCTC CT T T TTCCAACAAGAAGGGACT GAGAGATGCAGAAAT AT T 
TGT GAC AAAAAAT T AAAGC AT T TAGAAAAC T T 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSIAGKDQVETALKASFETCSYGWGDGF^ATISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKIG^ICVTEVmETSTMSTETEPFVENKAilFKNEAAGFGGVPTALLVIALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKWKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence : 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site* 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



a 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGGACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGGGACCTCGGCTCCAGTCAAGCCCGGCCCG 
G C T C T CAGC TAC CC GC AG GAG GAG G CC ACCC TC AAT G AGAT G T T CCGCGAG G T T GAGGAAC T GAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCAT CATCAGAAGT GAACC TGGCAAACT TACC T CCC AGCTATCACAATGAGACCAACACAGAC 
ACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 
T GGACAAATGGT C T T T T CAGAGACAGT TATCACAT C TGTGGGAGACGAAGAAGGCAGAAGGAGC C 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCGACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
* j ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
UJ CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
jjl CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
p| GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
Usk CCAGGAGCT GGAGGACCTGGAGAGGAGCC TGACTGAAGAGAT GGCGC TGGGGGAGCC TGCGGC T G 

CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
}h TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
fjj CATCTTCTTCCCAGTAAGTTTCCCGTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
Li] TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
CJ TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTT TGCAAACAT CAACC TGGCAAAAAT G 

f jj Ci^CAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCa\CCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TT T TCTT GAGGCAT GCACATC T GGAATTAAGG TCAAAC TAAT TCTCACATC CC TCTAAAAG TAAA 
CTACTGT TAGGAACAGCAGTGT TCTCACAGTGTGGGGCAGCCGTCCT TCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT TAACCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGAT TATAAATGAAAT TTGC 
AAAAT CACT TAGCAGCAACTGAAGACAAT TATCAAC CACGTGGAGAAAATCAAACCGAGCAGGGC 
TGT GTGAAACATGGT T GTAATAT GCGAC T GCGAACACT GAAC TCTAC GC CACTCCACAAATGAT G 
TTT TCAGGTGT CAT GGAC TGT TGCCACCATGTAT TC AT CCAGAGT T CTTAAAGT T TAAAGT T GCA 
CAT GATT GTATAAGCATGCT T TCT TTGAGT T T TAAATTAT GTATAAACATAAGT TGCAT T TAGAA 
ATCAAGCATAAATCACTTCAACTGCAAAAi\AAAAAAAAAAAAAAAAAAAAA 
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FIG URE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RS AVEEMEAEEAAAKASSEVNLANLPPS YHNETNTDTKVGNNT IHVHRE IHKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSBHEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence: 

amino acids 1-19 



y| N-glycosylation site, 

PI m amino acids 96-100, 106-110, 121-125, 204-208 



Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myr i s toylation si te . 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGG ACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTG<^AATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAG TGCTGCCTCTG GTGTCT GGAAAAAT T TAT CAAGT T CCTAAACC GCAATGCATACAT CAT GA 
% TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTGATGCTACTCATGCGAAAC 
X ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
III GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ill ACTTTAAGAGCCCCGACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
m GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
Q AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCGGG 
\V\ CCCTGATCCAGGACTGCACCCCACCCCCA^CGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
G CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
Jff TT GAGAGGCTGAGGCGGGCGGATCACCT GAGTCAGGAGT T CGAGACCAGCCTGGCCAACAT GGT G 

% AAACCTCCGTCTCTATTAAAAATACAAAA?\TTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGAC^GACTCTGTCTCCAAAACATVAACAAACAAACAAA 
AAGAT T T TAT TAAAGATAT T T T GT TAACTC 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQD I PT FPL I SAFIRT LRYHTGSLAFGAL I LTLVQI ARVI L 
E Y I DHKLRGVQNPVARC IMCC FKCCLWCLEK F I KFLNRNAY IMI AI YGKN FCVS AKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites, 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCilGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCGTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTmCCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGGCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAG<3AGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 



AppJD=10063581 



Page 163 



FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNS TVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWCEEGAGIPTVLQGHIDCGSLLGYI^AVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFL I LVGLTVGAFYI PDGS FTNI WFYFGWGS FLFIL I QLVLL I DFAHS WNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVAmMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
S FFHFCLVLAS LHVMMTLTNW YKPGE TRKMI S T W TAVWVKI CAS WAGLLL YLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGG ATGA ACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGa^GGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGTGAT GCAGTAT GACTACTATT CT TCATAT 
T T T GAT AT AT T T CT T C T GGCAGT T T T T CGAT T T AAAG T G T TAATAC T TGCATAT GC T G T GT GCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGA^AACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
% TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
m CTT GCCTGGAT T GAGACGTGGT TCC T GGAT T TCAAAG T G T TAC C T CAAGAAGCAGAAGAAGAAAA 

Ljj CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
yi GTCAGT T TTATT CCCCTCC TGAATCCGAAGCAGGATCTGAAGAAGC T GAAGAAAAACAGGACAGT 

'f* GAGAAACCACT T TT AGAAC TATGAGTACTACT T T TGT T AAATGTGAAAAACCC T CACAGAAAGT C 

ATCGAGGCAAAAAGAGGCAGGCAGT GGAG TCT CCCT GT CGACAGTAAAGT T GAAATGGTGAC GT C 
t*\ CACTGC TGGC T T TAT T GAACAGCTAAT AA&GAT TTAT TTATTGTAAT ACC T CACAAACGT T GTAC 

Ul I CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
Q TCAGTGAGACTGAGCCTGATGTGTTAACA?^ATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGACT TAATATATTGAAGTAACACT T TTT TAGTAAGCAAGATACCTT TTTATTTCAATTCAC 

ul 

Si'i AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTT GCAAAAGGGGAAGAAAGGAAT T GCGAA.TACAT GTAAAATGTCACCAGACAT T TGTAT TATT T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
CTAAATTAAAAA 
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FIGURE U 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLW 1 I ELNVNGGIENTLEKEVMQYDYYS S YFD I FLLAVFRFKVL ILAYAVCRIiRHWWAI ALTT 
AVTSAFLLAKVILSKLFSQGAFGWLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 



ill 

yi 



111 



m 



Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myri s toyl a ti on si te s . 

amino acids 14-20, 78-84, 79-35, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCe 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCi\ACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
M CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
*1 GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
%\ TGGATGTGTCCGACAAGCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
£H TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
L|| TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCGGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
\V\ TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
%!i CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
f J CCACGACCACCACAGCCACAGT GCC CACCACGAGGC CC G T GGTGCGGGAGC CCACAGCCT TGTC T 

K' : TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
(S CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCAGCTGCCTCA 
f j ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
P|S GGC CTG TACT G TGAGAGCCAGAT GGGGCAGGGGACACGGCCCAGC CC TACACCAGT CACGCCGAG 

y GCCACCACGGTCCCTGACCCTGGGCATCGj^GCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
P AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
U J GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGG.TCACCCA 
r| GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
^1 GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGC TGT GTGAC TC TAGTC TT GGC CC CAGGAAGCGAAGGAACAAAAGAAAC T GGAAAGGAAGATGC 
TTTAGGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCT 
GGGAAGATGTT TTTCAAACTCAGAGACAAGGACTT TGGT TTTTGTAAGACAAACGATGAT ATGAA 
GGCCT TT TGTAAGAAAAAATAAAAGAT GAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNEiRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPET^CVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
H f * VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 

tvtqlrpnatysvcvmplgpgrvpegeeacgeahtppavhsnhapvtqaregnlplliapalaav 

pi 

btsst 

llaalaavgaaycvrrgramaaaaqdkgqvgpgagplelegvkvplepgpkategggealpsgse 
cevplmgfpgpglqsplhakpyi 

LI! 

Important features : 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature, 
amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 



hi 

kit 
•I II 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGAC&GCTC ATG CG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCT.GCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCT TAGAT TCCAAGAC TAC TT TGACAT CAGATGAGT CAGT A 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGT CAAATATT TCTTGATTCAGAAGAATCTGA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGAT AT CAGC TT T CT AGAGT CTCCAAAT CCAGAAAACAAGGACTAT GAAGAGCCAAAGAA&GT A 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T TCCTAGATAAGGAGTAT GATGAATG TACATCAGATGG GAGGGAAGAT GGCAGAC TGTGGTGTG 
C T ACAACG TAT GAC TACAAAGCAGATGAA&AG T GGGGCT T T TG TGAAAC T GAAGAAGAGGC TGCT 
%. AAGAGACGGCAGAT GCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCT TAATGGAAG 

U\ CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGA?VCCATA 
Iff C CAAAGCCCT GGAGAGAGTGT CATATGC T C TT TTAT TT GGTGAT TAC T T G CCACAGAATAT CCAG 

\M GCAGCGAGAGAGAT G TT TGAGAAGCTGAC TGAGGAAGGCT CT CC CAAGGGACAGACT GCT C T TGG 

CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
T TGGAGCTCT TGGGGGCAAT C TAATAGCCCACATGGT T T T GGTAAGTAGAC T T TAGTGGAAGGCT 



jjf AAT AAT AT TAACAT CAGAAGAATT T G TGGT T TATAGCGGCCACAAC T T T TT CAGC T T TCAT GAT C 

Q CAGATTTGCT TGTATTAAGAC CAAATATT CAGT TGAACTT CCT TCAAATTCTTGT TAATGGATAT 

Ul AACACATGGAAT CT ACAT GTAAATGAAAGT T GGTGGAGTCCACAAT T T T TC TT TAAAAT GAT TAG 

r ! i 



T T TGGC T GAT TGCCCCTAAAAAGAGAGAT C T GATAAAT GGC TC T T T T TAAAT T T T C T C TGAGT T G 
GAAT T G T C AGAAT CAT T T T T TACAT TAGAT T ATCAT AAT T T T AAAAAT T T T T CT T TAG T T T T TC A 
AAAT T T T G TAAAT G G T GGC T AT AGAAAAACAACAT GAAAT AT TAT AC AAT AT T T T GCAACAATG C 
CCTAAGAATT GT TAAAAT TCATGGAG T TAT T TGTGCAGAATGACTCCAGAGAGC T C TACTT TCT G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAACTCATTTTTAATAAAATTATGTCTAAGAT TAAAA&AAAAAAAAAAAAAAAAAftAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAZUWkAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRR^^ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
SSQAKALVYYTFGALGGNLIAHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosamirioglycan attachment site. 

amino acids 2 67-271 

Microbodies C-terminal targeting signal. 

amino acids 2 99-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE iq 



a 



AAT TCAGAT T TT AAGCCCAT TCTGCAGTGGAATTTCAT GAACTAGCAAGAGGACAC CAT C T TCT T 
GTATTATACAAGAAAGGAGTGTACCTATO^CACACAGGGGGAAAA ATGC TCTTTTGGGTGCTAGG 
CCT CCTAATCCT CT GT GGT T T TCTGTGGACTCGTAAAGGAAAACTAAAGAT T GAAGACAT CACT G 
ATAAGTACAT T T T TAT CAC TGGATGTGACTCGGGCT TT GGAAACT TGGCAGCCAGAAC T T T T GAT 
AAAAAGGGAT T T CATGTAATCGCTGCCT GTC TGACT GAATCAGGATCAACAGC TT TAAAGGCAGA 
AACCTCAGAGAGACT TCGTAC TGTGCTTCTGGATGT GACCGACCCAGAGAATGTCAAGAGGACT G 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGT GC T G G C T C C CAC T GAC T G G C T G AC AC TAG AG G AC T AC AG AG AAC C TAT T G AAG T G AA 
CCTGTTTGGACTCATCAGTGTGACACTAAi\TATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
■51 TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
\ t j T AT GCAGTGGAAGGT TTCAATGACAGC T TAAGACGGGACATGAAAGC TT T T GGT GT GCAC G T C T C 

ill AT G CAT T G AAC C AG GAT T G T T C AAAAC AAAC T T G G C AG A T C C AG T AAAG G T AAT T G AAAAAAAAC 

Ml TCGCCAT TTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGT TACATTGAAAAA 

AG T CTAGACAAAC TGAAAGGCAATAAAT CCTATGT GAACATGGACC T CT CTCCGGTGG TAGAG T G 
m CATGGACCACGCT CTAACAAG T CTCT T CCCTAAGACTCAT TAT GCCGCT GGAAAAGATGCCAAAA 

TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGC TGGCTAAT CC CAAGGCAGTGTGj^C TCAGCTAACCACAAAT GTC TCCTCCAGGCTAT GA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAi\AGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTAT TTAGGC T TT GCCTGC T T GGTGTGATGTAAGGGAAAT TGAAAGACT TGCCCAT TCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCT C T TATC TAAATAT TAAAAGATAAGT CAAC CCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 



i y 
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FIGURE 20 
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MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENVK^^ 

REP I EVNLFGLI SVTLNML PLVKKAQGRVTNVS SVGGRLAI VGGGYTPSKYAVEGFNDS LRRDMK 
AFGVWSCIEPGLFKTNLADPVKVIEKKLAIW 

LS P WECMDHALTS L FPKTHYAAGKDAKI FWI PLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 137-190 and 253-256 
Glycosaminoglycan attachment site. 



Iri amino acids 39-42 

ft** 



N-myri s toylation si tes . 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GG T GAAGCCAAGAACAGCAT TACTGAT TCCCAAATGGATGATGT TGAAGTT GT T TATACAATTGA 
CATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAGGCGAAGTAAATG 
AGC AAGCAC T GAAGAAAATATTATCAAATGTCAAAAAGAATGTGG TAGGT T GGTACAAAT TCCGT 
CG T CAT TCAGATCAGAT CAT GACG T T T AGAGAGAGGCT GCT T CACAAAAAC T T GCAGGAGCAT T T 
TTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAATAACAGAAAGCTGCTCTACTC 
AT C GAC TGGAACAT TCC T TATAT AAACC T CAAAAAG GACT T T T TCACAGGGTAC CT T TAGT GGT T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TT TTAGCCGAGCAGTACAAACACACAGCTC TAAATTTTT TGAAGAAGATGGATCCT TAAAGGAGG 
T ACATAAGATAAAT GAAATGTAT GC T T CAT TACAAGAGGAAT TAAAGAGT ATAT GCAAAAAAGTG 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAA.CGAGAAATTGA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACAT TTTTCTTTGT CAGGCAT TACGGACCT T T T T T CCAAAT T C TGAAT T T CT TCAT TC AT GTGT T 
AT GTCT T TAAAAAATAGACATGT T T CT AAAAGTAGCTGTAACTACAACCACCAT CTCGAT GTAGT 
AGACAATCTGACCTTAATGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCAC 
AAATCATTAAGCATAAAGCCTTAGACT TAGATGACAGATGGCAAT TCAAGAGATCTCGGTTGT TA 
GAT ACACAAGACAAACGATCTAAAGCAAATAC TGG T AGTAG TAAC CAAGAT AAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGATGAAGGGT TT TGGT GAATATTCACGGTCTC 
CTACAT T T TGATCCTT T TAACCT TACAAGGAGAT T T TT T TAT T TGGCTGAT GGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGT TTGCAGTAATACACAGATAAC T CT TAG TGCATT TACT TCACAAAG TACT T T T TCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACAT TC T T TAGAATTGGAAAAG T GAGACCAGGCACAGTGGCTCACACC TGTAATCCCAGC AC T 
TAGGGAAGACAAGT CAGGAGGAT TGAT TGAAGCTAGGAG TTAGAGACCAGCC TGGGCAACG T ATT 
GAGACCATGT CTAT TAAAAAATAAAAT GGAAAAGCAAGAATAGCC T TAT T T TCAAAATAT GGAAA 
G AAAT T TATAT G AAAAT T TAT C T GAG T CAT T AAAAT T C T C C T T AAG T GAT AC T T T T T T AG AAG T A 
CAT TATGGCTAGAGTT GCCAGAT AAAATGC T GGATATCAT GCAATAAAT T T GCAAAACAT CAT CT 
AAAAT TTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGHVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEVVYTIDIQKYI 
PCYQLFSFYNSSGEVNEQALKKILSNVKK1WVGWYKFRRHSDQIMTFRERLLHPCNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGA.GCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
j* * CAAGAGAT GGAGGAGAAGT ACAGCCT GCAAAGT AAAGATGATT T GAAAAGT GTGGTCAGC GAACAGT GCAAT CATTTG 

C | CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTT GA.TGAATT CTTCAAAGAACTACTT GAAAATGCAGAGAAATCCCTG 

G I AATGATATGTTT GTGAAGACAT ATGGCCAT TTAT A.CATGCAAAAT T CT GAGCTAT TT AAAGATCT CTTCGTAGAGTTG 

01 AAACGTTACTAC GTGGT GGGAAAT GT GAAC CT GGA AGAAATGC T AAAT GACTTCT GG GCTC GCCT CCT GGAGCGGATG 

yj TTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAG 
111 CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGT T GCGGGAGATGT CGT GAGCAAGGT CT C CGTGGT AAACCCCACAGCCCAGT GTACCCAT GCCCT GTTGAAG 
N ! AT GAT CTACTGCTCC CACT GCCG GGGTCTCGT GACTGTGAAGCCAT GTTACAACT ACT GCTCAAACAT CAT GAGAGGC 

Z TGTTTGGCCAACCAAGGGGATCT CGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 

M GAGGGTCCTTT CAACATT GAAT CGGT CAT GGAT CCC ATCGATGTGAAGAT T TCTGAT GCTATTAT GAACAT GCAGGAT 

Ui AATAGTGT T CAAGTGT CTCAGAAGGTTTT C CAGGG AT GTGGAC CCCC CAAGCCCCTC C CAGC TGGAC GAAT T TCT C GT 

'hsi TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
kl I AGTTT GGACCGACTGGTTACTGATGT CAAGGAGAA ACTGAAACAGGC C AAGAAAT TCT GGTCCTCCCTTCCGAGCAAC 

P GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAG 
1IJ CT GTT TGCAGTGACAGGAAATGGATTAGC CAAC CAGGGCAACAACCCAGAGGTCCAGGT TGACACCAGCAAACCAGAC 

AT ACT GAT CCTT CGT CAAATCAT GGCTCTT CGAGTGATGACCAGCAAGAT GAAGAAT GCATACAAT GGGAACGACGT G 
GACTT CTTT GATATCAGTGAT GAAAGTAGT GGAGA AGGAAGTGGAAGT GGCT GTGAGTATCAG CAGTGCCCTTCAGAG 
TT T GACTACAAT GCCACT GACCATGCTGGGAAGAGTGCCAATGAGAAAGC C GACAGT GCT GGT GT CCGT CCT GGGGCA 
CAGGC CTACCTC CTCACTGTCTT CT GCAT CTT GT TCCTGGTTATGCAGAGAGAGT GGAGATAAT T CTCAAACTCT GAG 
AAAAAGTGT TCAT CAAAAAGTTAAAAGGCACCAGT TATCACTTTT CTACCATCCTAGT GACTTTGCTTTT TAAAT GAA 
TGGACAACAATGTACAGTTTTTACTAT GT GGCCAC T GGTTTAAGAAGT GCTGACTTT GTTTT CT CATT CAGTTTT GGG 
AGGAAAAGGGACT GT GCATTGAGTT GGTT C CTG CT CC CCCAAAC CAT GTTAAACGTGGCTAACAGT GT AGGT ACAGAA 
CTATAGTT AGTT GTGCATT TGT GATTT TAT CACT CT AT TATTT GT TT GTAT GTT TTT T TCTCATTTCGTTT GTGGGT T 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CTGAAATAT TAAAT AGCT GTACAGAAGCAGGTTTTAT TTAT CAT GTTAT CTTAT TAAAAGAAAAAGC CCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 

LYMQNSELFKDLFVELKRYYWGKVNLEEMLNDFWARLLEMFRLVNSQYHFTDEYLECVSKYT^ 

QLKPFGDVPRKLKLQVTRAFVAART FAQGLAVAGDWSKVS WNP TAQCTHALLKMI YCSHCRGL 

VTVKPCYNYCSNIMRGCLMQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDVKISD 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 

EKLKQAKKFWSSLPSWCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 

KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 

KSAKEKADSAGVRPGAQAYLLTVFC I LFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P~loop) . 

amino acids 515-524 

N-glycosylation site, 

amino acids 514-518 

Glycosarainoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230 7 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGjfyiTCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
C AG GCT T C TAGGAGATG GC T C CAGGAAG GC GGCCAAG AAT G T GAG T GCAAAG AT T GG T T C C TGAG 
AGCCCC GAGAAGAAAAT TCAT GACAG T GT C T GGGC T GCCAAAGAAGCAGT GCCCCTGTGATCAT T 
T CAAGG GCAAT GT GAAGAAAACAAGACACCAAAGG CACCACAG AAAGCCAAACAAGCAT T CCAGA 

fah gcc tgc cagcaat tt ct caaacaat g tcagctaagaagc t t tgctc tgcc t t t g taggagct ct g 

o agcgcccactcttccaat taaacattctcagccaagaagacagtgagcacacctaccagacact c 

^! ttcttctcccacctcactctcccactgta:ccacccctaaatcattccagtgctctcaaaaagca 

pi? 

r \ tgtttttcaagatcattttgtttgttgctctctctagtgtcttcttctctcgtcagtcttagcct 

yj 

ill gtgccctccccttacccaggcttaggctt^ttacctgaaagattccaggaaactgtagcttcct 

31 agctagtgtcatttaaccttaaatgcaatcaggaaagtagcaa^^ 



AAATGTCAAAAAAAAAAAAAAAAAA 



AppJD=10063581 



Page 177 of 320 



FIGURE 26 

MKVLISSLLLLLPLMLMSMVSSSLNPGVAIIGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 



i„i N-myristoylation sites. 

\3 amino acids 27-33, 46-52 

5 
m 
ui 
in 
m 

M 

a 
m 
a 

y| 
P 

rtj 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGAC&TCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
T GGCCG TGCTGACT CCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGC TC T TCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
GAACGCCCAGTTCTGCGAAATCCAAATGGJVICGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
T TC CCCAGCC T CCAATTAGAACAAGCCACCCACCAGCC TATC TATC T TCCACTGAGAGGGAC CTA 
GCAGAATGAGAGAAGACATT CAT GT AC CACCTAC TAGT CC CT CTCTC CCCAACC TC TGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACT CAAAGAGGCAGAGGT T T TGT TCT CAAATATT TT T T AATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHI GDGCCLTRETWKDLENAQFSE I QMERQPPPLKWLPVGPH IMGKAVKQS FPS S KAL ICS FPS L 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern , 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAA.GCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGC TAGGGGGGCTGCCT TAT TTAAA 
GTGGTTGTTTATGATTCTTATACTAATTTATACAAAGATATTAAGGCCCTGTTCATTAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAA&AAAAAAAAA 



r='; 



.p 
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FIGURE so 

MLLLTLLL LLLLLKGSCLEWGLVGAQKVS SATDAP I RDWAFFP PS FLCLLPHRPAMTC S QAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 



u . Growth factor and cytokines receptors family: 

f;| amino acids 3-18 

5 

til 

m 
m 

a 

II! 

a 
ui 

Q 
III 
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FIGURE 31 

GTT TGAATTCCT TCAAC TATACC CACAGTC CAAAAGCAGACTCACTGTGT C CCAGGCTAC CAGT T 
CCTCCAAGCAAGTCATTTCCGTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT TCCGAGAAT CAT T G TCATGT ACAT GCAAAACGCACTGAAAGAACAGCAGCATGGT GCATTGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATGCATATACTACAACTGCTAT TAATGGGACAGATT TCTGTACATCAGCAAAAGATGC 
ATT CAAAATC T T GT CCAAGAACT CAAGTC&C T T TACAT CTAT TAAC T GC T T TGGAGAC T TCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGAT GGAT CGTCAGAAAAGCCC TACT T TAT GGATCAAGAAT T T C TGAG T T TCGT A 
AAAAGGAGC AACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACT CAT TAAGGAAT GAGGA 
GGGAACAGAAC TCCAGGCCAT TGTGAG ATAGA TACCCAT T TAGGTAT CT GT ACC TGGAAAACAT T 
TCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGT TAGTGAATT T TTT T 
T TAAAAGACC TAATAAACCC TATTCT TCCTCAAAA 



AppJD-10063581 



Page 183 of 320 



FIGURE 32 

MSGRDTILGLCILALALSLAM^ 

LDTERENMKCVXGFAIVSTGI TAVLLVLI FVLRKRIKLTVELFQI TNKAI SSAPFLLFQPLWT FA 
I LI FFWVLWAVLLSLGTAGAAQVMEGGQVE YKPLSGIRYMWS YHL IGL I WTS E FI LACQQMT I A 
GAWTC YFNRSKNDPP DHP I L S SLS I LFFYHQGT WKGS FL I SWRI PRI I VM YMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FIIFLGKVLWCFTVFGGl^FNYNRAFQWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-2 0 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 
N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 
N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 3 3 

GTT CGAT TAGC T CCTC T GAGAAGAAGAGAi ! \AAGG T T CT TGGACC TC T CCC TGT T TC TT CC T TAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAi^AGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAAT T T T T TGAAAAAAAAAT TGC CT TC TTCAAACAAGGGTGTCAT TC T GATATT TATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACT GTGCCTCAGATCAACTGC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
AGAGCT GATC CAGGTAT CCAAAGGC AAGATCC T TCAGGAGCT GCC T T CCAGAAACC T GT T GGAGC 
GGATGT CAGC C T GGGAC T TG T T CCAAAAGAAGAAT T GAGCACACAGT CT T TGGAGC CAGTAT CC C 
TGGGAGATCCAAACT GCAAAAT TGACTT GTCG T T T T TAAT TGATGGGAGCACCAGCAT TGGCAAA 
CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
C GGTCCAC TGATGGG TGT TGT CCAGTAT GGAGACAACC CT GC TAC T CACT T TAACC TCAAGACAC 
ACACGAATTCTCGAGATCTGAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACTTT CTAAT 
GTAGGTCGGGCCATCTCCTTTGTGACCAA(3AACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
GACT TGCGAGAGAGTCAGGAAT CAACATT TT C T TCATCAC CAT TGAAGGT GCTGCTGAAAATGAG 
AAGCAGTATGTGGTGGAGCCCAACT TT GCAAACAAGGCCGTGTGCAGAACAAACGGCT TCTACT C 
GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
TGGAGTTTGGGTTCGACAAGTACAGCAGCj^AGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GT TTGACAACCTCCATCAG TATGT C CC CAGGAT CAT CCAGAACATT T GTACAGAGT TCAAC TCAC 
AGCCTCGGAACTGAATTCAGAGCAGGCAGi\GCACCAGCAAGTGCT GCTT TACTAAC TGACG T GTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGATCACAAACGTATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGAT TT TACAT 
TT TGACAAT TG TT TT CAAAATAAAT GTT CGGAATACAGTGCAGCCCT TACGACAGGCT TACGTAG 
AGCT TT TGTGAGAT T TT TAAG T TG T TAT T TC T GAT T TGAAC TC TGTAAC CCTCAGCAAGT T T CAT 
T T T T GT CAT GAC AAT GT AG G AAT T G C T G Ai\T T AAAT GTT T AGAAG GAT GAAAAAT AAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 3d 



MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYAS YS S VCGAAVHS GVLDNSGGK I LVRKVAGQ S GYKG S YSNGVQS LS LPRWR 
ES FI VLESKPKKGVTYPSALTYS SSKSPAAQAGET TKAYQRPP I PGT TAQPVTLMQL1AVTVAYA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPOTWVMVDGWPTDKVEEASRL^^ 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLj^CSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI L I TDGRS YDDVRI PAMAAHLKGV I TYAI GVAWAAQE E LEV I ATHPARDHS FF 
VDEFDNLHQYVPRI IQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-gl ycosyl a ti on s i tes . 

amino acids 390-394, 520-524 

N-myristoylation sites . 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 



CCGAGC AC AGGAGATTGCCT GCGT TT AGGAGGTGGCT GCGTTGTGGGAAAAGCT AT CAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CT CAGC CT TCCCCACT AC AATGT GAT AGAACGCGT GAACT GGATGT ACT T CT ATGAGT AT GAGC CGAT TT A 
C AGACAAG ACT T TCACT T C ACACT TC GAGAGC ATT C AAAC TGCT CT CAT CAAAATCC ATT T C TG GTC ATT C 
TGGT GACCTCCC AC CCT T CAGAT GTGAAAGCC AGGC AGGCCATT AG AGT T ACT TGGGGT GAAAAAAAGTC T 
TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 
AT T GTC CT T AGAGGAT GAAC ACCT TC T T T ATGGTGACATAAT CCG ACAAGAT T TTT TAGACACAT AT AAT A 
ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 
AAGACAGACACTGATGTTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGA 
GAAGTTTTTCACAGGTTATCCTCTAATTGAT^ATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 
CT TACCAGGAGT ATCCTT TCAAGGTGTTCCCTCCAT ACTGCAGTGGGTTGGGTTAT AT AATGTCCAGAGAT 
TT GGTGCC AAGG AT CT AT GAAATGAT GGGTC ACGTAAAAC CC ATC AAGT T TGAAGATGTT TATGT CGGGAT 
CTGTTTGAATTTATTAAAAGTGAACATTCAT^lTTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 

at ttggatgtct gt caac tgag ac gt gt gatt gcagcccatggctttt ct tccaagg agatcatc acttt t 
tggcaggtcatgctaaggaacaccacatgccattattaacttcacattctacaaaaagcctagaaggacag 
gataccttgtggaaagtgttaaataaagtaggtactgtggaaaattcatggggaggtcagtgtgctggctt 
acactgaactgaaactcatgaaaaacccagac:tggagactggagggttacacttgtgatttattagtcagg 

CCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAAGAAATTAATAGG 
ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
T ATCTT ATGTGG AT TACCAATTT AAAAAT ATATGTAGT T CTGTGTC AAAAAACTTCT T CACT GAAGTT AT A 
CT GAACAAAATT TT ACCT GTTTTT GGTCATTT AT AAAGT ACTTC AAGAT GTT GC AGT AT T TCAC AGTT AT T 
AT TATTT AAAAT TACT TCAACTTTGT GTTTTT AAATGT TTTGACGATTTCAAT ACAAGAT AAAAAGGATAG 
TGAATCATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCACTCCA 
T TAATGT AAAGT CATAGGTC ATT ATT GCAT ATC AGT AAT CT CTT GG ACT T TGTT AAAT AT TT T ACT GTGGT 
AATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLL^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDI IRQDFLDTYNNLTLKT IMAFRWTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP I KFEDVYVG I CLNLLKVNI HI PEDTNLFFLYRI HLDVCQLRRVIAJVHGFS S KE 1 1 T FWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites, 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 3 7 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGCCCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AATATGATCAGATTGAGTGC GTC T GCCCCGGAAAGAG^GGAAGTCGTGGGT TATAC CATCCC TTGCTGCAGGAATGAGGAGAA 
TGAGTGTGAC TC CTGC C TGAT CCACCCAGGTTGTACCAT C TTTGAAAAC TGC AAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCCAGGTTC TGCGAGCCCC AAAGGGTCAGATTT TGTTGGAAAGC TATCCCC TAAAT GC TCACTGTGAATGGACCAT TCAT GC 
TAAAC CTGG GT T TGTC ATCC AAC TAAGAT TTGTCATGTT GAGTC T GGAGTT TGAC T ACATGTGCCAGTATGAC TAT GTT GAG 
GT T C GTGAT GGAGAC AACCGC GATGGCCAGATCAT CAAGC GTGT C TGTGGC AACGAGCGGC CAGC TCC TATCCAGAGCATAG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAA.GTGTGCCTGCTTGGCAGGC 

tatac tg ggcagcgc t g tgaaaat c tcc t tgaagaaagaaactgctcagaccc tgggggc c cagtcaatgggtac c agaaaa 
taacagggggccctgggcttatcaacggacgccatgc:taaaattggcaccgtggtgtctttcttttgtaacaactcctatgt 
tct tagt ggcaatgagaaaagaac t tgccagcagaatggagagtggt cagggaaacagcc c at ct gcataaaagcctgc cga 

GAACCAAAGATT TCAGACC TGGT GAGAAGGAGAGT TC TT C CGATGCAGGTTCAGICAAGGGAGACACCAT TACACC AGC TAT 

act cagc ggcc t tcagcaagcagaaactgcagagtgcrcc c taccaagaagccagccc ttc c ct ttggagatc tgccc at ggg 
ataccaacatctgcatacccagctccagtatgagtgc:atctcacccttctaccgccgcctgggcagcagcaggaggacatgt 
c tgaggactgggaagtggagtgggc gggcaccatc ctgcatccctat c tgcgggaaaattgagaacatcact g c t ccaaaga 
cccaagggttgcgctggccgtggcaggcagccatctacaggaggaccagcggggtgcatgacggcagcctacacaagggagc 
gtggttcctagtctgcagcggtgccctggtgaatgagcgcactgtggtggtggctgcccactgtgttactgacctggggaag 
gtcaccatgatcaagacagcagacctgaaagttgttttggggaaattctaccgggatgatgaccgggatgagaagaccatcc 
agagcctacagatttctgctatcattctgcatcccaactatgaccccatcctgcttgatgctgacatcgccatcctga^gct 
cctagacaaggcccgtatc agcac c cgagtccagc ccatc tgcc tcgc tgc c agtcgggatctcagcacttc c t t ccaggag 
tcccacatcactgtggctggctggaatgtcctggcagacgtgaggagccctggcttcaagaacgacacactgcgctctgggg 
tggtcagtgtggtggac tcgc tgc t gtgtgaggagcagcatgaggac c atggcatcccagtgagtgtcact gataac atgt t 
ctgtgccagctgggaacccactgccccttctgatatctgcactgcagagacaggaggcatcgcggctgtgtccttcccggga 
cgagcatctcc tgagccacgc tggcatc tgatgggac tggtcagctggagc tatgat aaaacatgcagccacaggctct cc a 
ctgccttcaccaaggtgctgccttttaaagactggattgaaaga^ 

TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTTCTGACTTCAGGGAC^AAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
C TACTAGGAC AGCCAAT TGGAAGAT GCCAGGGC TTGCAAGAAGTAAGT T TC T TCAAAGAAGACCATATACAAAAC CT C T CC A 
C TCCACTGACC TGGT GGTC TTCC C CAAC T TTCAGT TATAC GAATGCC AT CAGC TTGACCAGGGAAGATCTGGGC TTCATGAG 
GCCCC TT TTGAGGCTC TCAAGTTC TAGAGAGCTGCC TGTGGGACAGC CC AGGGCAGCAGAGCTGGGATGTGGT GCAT GCC T T 
TGTGTACATGGCCIACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 





FIGURE 5*8 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCOTGSWGGTLDDFYWGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQ^^ 

KRVCGNERPAP I QS I GS S LHVL FHSDGS KNFDGFHAI YEE I TACS S SPC FHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLTORRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALWERTVWAAHCVTDLGKVTMIKTADL 
KVVLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLDADIAILKLLDKARISTRVQPICLAASR 
DL S T S FQE S H I T VAGWNVL AD VRS P G FKND T L R S GWSWD S L LCE E QHE DH G I P VS VT DNMFC A 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites, 

amino acids 96-100, 279-283, 316-320, 451-455/ 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



G GTTC CTACATC CTCTCAT CTGAGAATCAGAGAGCAT AAT CTT CTTAC GGGCCC GTGATTTATTAACGT GGCTTAATC 
T GAAGGTTCTCAGTGAAATTCTT TGTGAT CTACT GATT GT GGGGGCAT GGCAAGGTTT GCTT AAAGGAGCTT GGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
T GGTCT TGCCTT GGCT CAGTCCTGCTAACTACATT GACAAT GTGGGCAAC CTG CACTT CCTGTAT TCAGAACTCT GTA 
AAGGT GCCT CCCACT ACGGCCT GACCAAAGATAGGAAGAGGCGCTCAC AAGAT GGCT GTC CAGACGGCT GT GCGAGCC 
T CACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCT CCTTAAT GACAGACGAGCCT GGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCi\GATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTTCGAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGAAAACACCACTGCCCCTGAAG 
TCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAA 
GCCT CT CT AT TAGGCTGGT GGGAGGTAGCGAAAC C CCACT GGT C CATAT CAT TAT CCAACACAT T TAT C GT GAT GGGG 
T GATCGCCAGAGACGGCCGGCTACTGCCAGGAGACAT CAT T CTAAAGGT CAACGGGAT GGACAT C AGCAAT GT C C CT C 
ACAACTACGCTGTGCGTCT CCT GCGGCAGCCCTGCCAGGTGCTGTGGCTGACT GT GAT GCGT GAACAGAAGTTCCGCA 
GCAGGAACAATGGACAGGC CC C GGAT GCCT ACAGACCCC GAGAT GACAGCTTTCATGT GATT CT CAACAAAAGTAGCC 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GT GT GGCAT ATCGACAT GGT CAGCTT GAGGAGAATGACCGT GT GTT AGC CATCAATG GACAT GAT CTT C GATAT G GCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGT(3AAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGCCCTGACATCTTTC^GGAAGCCGGCTGGAAC^GCAATGGCAGCTGGTCCCC^^ 
S*jj CT CCCAAGC CC CTCCAT CCTACAATTACTT GTCATGAGAAGGTGGTAAAT AT CCAAAAAGACCC C GGT GAAT CTCTCG 

GCATGACCGT C GCAGGGGGAGCAT CACATAGAGAAT GGGAT TT GCCT ATCTATGT CAT CAGT GTT GAGC CCGGAGGAG 
% jj T CATAAGCAGAGATGGAAGAATAAAAACAGGT GACi\T TTT GTT GAAT GTGGATGGGGT CGAACT GACAGAGGTCAGCC 

f j GGAGT GAGGCAGTGGCATTATTGAAAAGAACATCATCCTCGATAGTACTCAAAGCTTTGGAAGT CAAAGAGTATGAGC 

%l CC CAGGAAGACT GCAGCAGCCCAGCAGC CCTGGACT CCAAC CACAACAT GGCCCCAC CCAGT GACTGGT CC C CAT C CT 

p; | GGGTCATGT GGCTGGAATTACCACGGTGCTTGTATAACTGTAAA.GATATT GTATT AC GAAGAAACACAGCT GGAAGTC 

III 

T GGGCTTCT GCATT GTAGGAGGTTAT GAAGAATACAATGGAAACAAACCT T TT T T CAT CAAAT CCATT GTT GAAGGAA 
CACCAGCATACAATGATGGAAGAATT AGAT GTGGTGATAT TCTTCTTGCT GTCAAT GGTAGAAGTACAT CAGGAAT GA 
TACAT GCTT GCT TGGCAAGACT GCT GAAAGAACTTj^AAGGAAGAATTACT CTAACTATTGT TTC TTG GC CT GGCACTT 
T TT TATAGAAT CAAT GAT GGGT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACTAT AT TT ATC 
T T GT CAGTT TTTATAT TTAAAGAAAGAATACATT GT AAAAATGT CAGGAAAAGTATGATCAT CTAATGAAAGCCAGTT 
ACAC C TCAGAAAATAT GAT TCCAAAAAAAT TAAAACT ACTAGTTTTT TTT CAGT GTGGAG GATT T CT CATTACT CTAC 
AACAT TGTT TATATTT TTT CTATTCAATAAAAAGCC CTAAAACAACTAAAATGATT GATTT GTATACCC CACTGAATT 
CAAGCTGAT TTAAATT TAAAATTTGGTATAT GCT GAAGT CT GCCAAGGGT ACATT AT GGCCATT TT TAAT TT ACAGCT 
AAAATATTT TTTAAAATGCATT GCTGAGAAACGTT GCTTT CAT CAAACAAGAATAAATATTT TTCAGAAGTT AAA 
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FIGURE AO 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVI LNKS S PEEQLG I KLVRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAMLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSP7EGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 . 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE dl 



f B 1 



ACCAGGCATTGTATCTTCAGTTGTGATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTT GCCT GCAGTGAAGCAGAGAGATAGATATTATT CACGTAATAAAAAACATGGGCTTCAACCT GACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
C AC CAGT AACT ACTT C GTG G GTG C CAT T CAAGAGAT T CCTAAAG C AAAGGAGT TCAT GG C TAAT T TCC 
AT AAGAC CCTCAT TTTGGGGAAGGGAAAAACT CTGACTAAT GAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCilGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
T T T GG AAGAGG T ACAG GCAGAAAAT CC C AAAGT GT CC AGAG GCC G GT AT CG C C CT CAGGAAT G T AAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCilCCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
U CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
Q T AAAAAGT T TAAT C G AGC C AAAC T C T T G AAT G T GGG C TAT C T AGAAGC C C T C AAG GAAG AAAAT T GG G 

ACT GCT T TATAT TCCACGAT GTGGACCTGGTAC CCGAGAAT GACT TT AACC T TTACAAGTGTGAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAAa^GCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
% GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
m G GG GAGG CG AAGACG AT GAC CTCAGACTC AGG G T T GAGCT C CAAAGAAT GAAAAT TTCCCGGCCCCTG 

Ufc CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
? GAT GAAGCT CT TACAC CAAGT GT C ACGAGTCT GGAGAACAGAT GGGT T GAGTAGT T GTT C T TAT AAAT 

Q TAGTATC TGTGGAACACAAT CCTTTATATATCAACATCACAGTGGAT T TCT GGTTT GGTGCATGACCC 

hn T GGATCT TT T GGT GAT GT T T GGAAGAAC T GAT T CT T T GT T T GCAAT AAT TT T GGC CTAGAGAC T T C AA 

\l \ ATAGTAGCACACATTAAGAACCTGTTACAGCTCAT TGTT GAGCTGAAT TTT TCCT TTTTGTATT TTCT 

%\ TAGCAGAGCTCCTGGT GATGTAGAGTATAAAACAGTTGTAACAAGACAGCT TTCTTAGTCATTT TGAT 

fjj CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AAT GAAC GC TAT T TG AGGAC T CT GGTTGAAG GAGAT T TAT T TAAAT T T GAAGTAAT ATAT TATG GGAT 
AAAAGGC CACAGGAAATAAGACTGCTGAATGTC TGAGAGAACCAGAGTTGT T CTCGTCCAAGGT AGAA 
AGGTACGAAGATACAATACT GTTATTCAT TTAT CCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGT AGCAGGAGGGTGGAGT GT CGGC TG C AAAGGCAGC AGTAGC T GAGCT GGT TGCAGGT GCTGAT AG C 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTT TAAAGAGTTTTTGTAAAATGATTTT GTACAAGTAGGATATGAAT TAGCAGTTTACAAGTTTACAT 
ATT AACTAATAATAAATATGTCTAT CAAATACC TCTGTAGTAAAATGT GAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQE I PKAKE FMAKFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLSEVQAENPKVSRGRYRPQECKMjQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

% I amino acids 1-27 

y] N-glycosylation sites, 

ll! amino acids 4-8, 220-224, 335-339 

u 

Xylose isomerase protexns. 

amino acids 191-202 

Q 

. ... 
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FIGURE A3 



Lfl 
m 



Lfl 

r\ 

yj 

,■33 » 

rii 



GCT CAAGACCCAGCAGT GGGACAGCCAGACAGACGGCACGATGGCAC TGAGCTC CCAGAT CTGGG 
CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 
ACGGGACAACTTGCAGAGCTGCASlCCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

gttccagaggcgaaggaggcgagacacccacttccccatctgcattttctgctgcggctgctgtc 

atcgatcaaagtgtgggatgtgctgcaag^cgtagaacctacctgccctgcccccgtcccctccc 

ttccttatttattcctgctgccccagaac^taggtgttggaataaaatggctggttcttttgttt 

tccaaaaaaaaaaaaaaaaaaaaiy\aaiulaaaaaaaa 

aaaaaaaaaaaaaaaaaaaaaaaa^aaap, 
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FIGURE aa 

MALSSQIWAA.CLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 4 4-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 4* 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT ATG GCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTT CAACACAACCCCTCT T GTCACCATACAGCCAGAAGGGGGCACT AT CATAGTGAC CCAAA 
AT C G TAATAGGGAGAGAGTAGACT T CCCAGATGGAG GC TACT CCC T GAAGCTCAGCAAAC TGAAG 
AAGAATGACT CAGGGAT C TAC TAT GTGGGGATATACAGCT CATCAC T CCAGCAGCCC TCC AC CCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
AT AAGAATGG CACC TGTG TGACCAATCT GACATGCT GCAT GGAACAT GGGGAAGAGGATG T GAT T 
M t TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 

jjj CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 

Sf TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 

tit 

i : i GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 

It] GT T TC TGAAGAGAGAGAGACAAGAAGAG TACAT TGAAGAGAAGAAGAGAG T GGACAT T TGT CGGG 

CI J AAAC TCC TAACATAT GCCCCCAT TC TGGAGAGAACACAGAGTACGACACAATCCCT CACAC TAAT 

r AGAACAATCCTAAAGGAAGATCCAGCAAATACGGT TTACTCCACTGT GGAAATACCGAAAAAGAT 

^ G G AAAAT CCC CAC T C AC T GC T CAC GAT G CC AGACAC AC CAAGGCTAT T T G C C TAT G AG AAT GT T A 

jl'f TC TAGA CAGCAGTGCACTCCCCTAAGTCTCTGCTCA 

UJ 

r'- s ! | 
KM ,7 

111 
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FIGURE 46 

MAGS PTCLTLI YI LWQLTGSAASGPVKELVGSVGGAVT FPLKSKVKQVDS IVWTFNTTPLVT IQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQSYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKAIjGQAANESHNGSILPISWRWGESDMTFIC 
VAI^PVSRNFSSPIIARKLCEGAM)DPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE an 

GGCTCGAGCGT T TCTGAGCCAGGGGTGACCATGACC TGCT GCGAAGGATGGACAT CCT GCAAT GG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGG T CT GATGGCCAT TCCAGCAACAACAATGT CCT TGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT T G T GAAT T T T CAT TG AAAAAC AT CAG T GACAT T CAT CCAGAAT C C T T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
H ? ACACCATGGCGAGT GGC T GGAGAGCAT C T AGT T T CC AC T T CGAT TC T GAAGAAAACAAAC AT AGG 

W CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
% CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T i T GTAGT T TAATGGGAATAfiAATGTAAGTAT CAGTAGT T T GAAARAA A A A A A 

rfcS i 
: I? 



CI 

ui 
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FIGURE AS 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
T TMSLTARKRACCNNRTGMFLS S FFSVI TVTGALYCML I S IQALLKGPLMCNS PSNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II},. 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 30-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIG URE J.Q 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATT T GCCAATAAAGACGATCCC TT CT AC TATGACT GGAAAAACC T GCAGC T GAGC GGAC TGAT C 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
C AGCCAGAAGCAGCACAGTCCT GTACCTGAGAAGGCCATC C CAC TCAT CAC T CCAGGC T C TGCCA 
CTACTTGCT^GCACAGGACTGGGCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
^ TCCTCCCCTGGGAGGCCTTATCCTCAAGGrAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
2.* T T CT GAT C AGGAGGCT TC T T TAT GAAT T£ AAC T C GC CC CACCACC CC C T CA 



hi ■>. 



Ml 
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FIGURE so 

MERVTLALLLLAGLTALEANDP FANKDDPFY YDWKNLQLSGL I CGGLLAI AG I AAVLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 



N-myristoylation sites. 



Transmembrane domain: 

amino acids 36-59 

m 
Q 
m 

amino acids 41-47, 45-51, 84-90 

1 

O'l Extracellular proteins SCP/Tp:x-l/Ag5/PR-l/Sc7 . 

r " amino acids 54-67 

z 

Q 

m 
o 

o 
m . 
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FIGURE 51 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 
AGGACAGGGAGT CGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGC GGCAAGG 
AGGAGACCC TGGT GGGAGGAAGACAC T CTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTC CAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCMTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AG C AGAT G C T T T GG G C AACAG GG T C G G G GAAG CAG C C CAT GC T C T GG GAAACAC T GGG CAC GAGA 

TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
GGGCCACCAAAC T T TGGGACCAACAC T CAGGGAGCTGTGGCCCAGCCTGGCTATGGT TCAGTGAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT T T TAAATC CAAGC T GGGT T T CAT CA^C TGGG ATGCCAT AAACAAGGACCAGAGAAGC T C T C 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA. 
AACACCACCC TCTCATCACTAATCT CAGCCCT TGCCCT TGAAATAAACCTTAGCTGCCCCACAAA 

MAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNWNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSmREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGWTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 



M Signal peptide: 

2; amino acids 1-21 

81 



N-glycosylation site. 

amino acids 2 65-269 

Glycosaminoglycan attachment site . 

amino acids 235-239, 237-241 r 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGG ATG TCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTAT.CACCAATGCCTCAGCTGCCA 

, TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 

pi » 

Q AG T GGC GG T GAC AAG T GGAGC CG CC AC CG T CGGATG C T G AC GCCC GC C T T C CAT T TCAACAT CC T 

0 GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 

l f] c c t cagagggc agc agtcg t c t ggac at g t t tgagc acat c agcc t cat gacc t t ggacagt c t a 

cagaaatgcatcttcagctttgacagccattgtcaggagaggcccagtgaatatattgccaccat 
m cttggagctcagtgcccttgtagagaaaagaagccagcatatcctccagcacatggactttctgt 
j*b attacctctcccatgacgggcggcgcttccacagggcctgccgcctggtgcatgacttcacagac 
* gctgtcatccgggagcggcgtcgcaccctccccactcagggtattgatgattttttcaaagacaa 
J=:f agccaagtccaagactttggatttcattgatgtgcttctgctgagcaaggatgaagatgggaagg 
K ; | cattgtcagatgaggatataagagcagaggctgacaccttcatgtttggaggccatgacaccacg 
ri] gccagtggcctctcctgggtcctgtacaaccttgcgaggcacccagaataccaggagcgctgccg 
y acaggaggtgcaagagcttctgaaggaccgcgatcctaaagagattgaatgggacgacctgg^ 
ill agctgcccttcctgaccatgtgcgtgaaggagagcctgaggttacatcccccagctcccttcatc 
tcccgatgctgcacccaggacattgttctcccagatggccgagtcatccccaaaggcattacctg 
cctcatcgatattataggggtccatcacaacccaactgtgtggccggatcctgaggtctacgacc 
ccttccgctttgacccagagaacagcaaggggaggtcacctctggcttttattcctttctccgca 
gggcccaggaactgcatcgggcaggcgttcgccatggcggagatgaaagtggtcctggcgttgat 
gctgctgcacttccggttcctgccagaccacactgagccccgcaggaagctggaattgatcatgc 
gcgccgagggcgggctttggctgcgggtggagcccctgaatgtaggcttgcagtgactttctgac 
c catccacctgt tt t t t tgcagatt gt cat gaataaaacgg t gct gt caaa 
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FIGURE zj. 

MSLLSLPWLGLRPVAMS PWLLLLLWGSWL1ARILAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKZ^LSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTWPDPEVYDPFRFDPENSKGRSP 
LAF I P FS AGPRNC I GQAFAMAEMKWLALMLLH FRFLP DHTE PRRKLE L IMRAEGGLWLRVE PLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature, 

amino acids 4 61-471 

N-glycosylation sites, 

amino acids 112-116, 168-172 
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FIGURE an 



ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGA&GCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACT TGCACT TATCTTCT GCATTT TGCAGTCT TTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT TCATGGCCAG TTT TATGAAGC T T T GGAAGGCACTATGGACAGAAGCTGG T GGACAG T TT T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
G T GAT TC GAACAT T TGAGGG T T AC TTT T GGAAGCAACAAT ACAT TC T CGAACC TGAAT G T GAG T A 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T CT GGAT G T TG T CC CAC T GAAT TCCCAT GAATACA?YACC T AT TCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAZ\AAAAAAAA^ 
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FIGURE 56 



MGPVKQLKRMFEPTRLIATIMVLLCFALTLCSAFWWHNKGLALI FCILQSLALTWYSLSFI PFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE 57 



C GGCT CGAGCT C GAGC CGAATCGGCT CGAGGGGCAGT GGA.GCACCCAGCAGGC CGCCAAC ATGC T CTGTCT GTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTT CAAGCTCAGT GTCTTCAT CC CCTCCCAGGAATTCT CCAC CTACCGC CAGT GGAAGCAGAAAATTGTACAAGCT 
GGAGATAAGGACCTT GAT GGG GAGCTAGACTTT GAAGAATTTGTCCATTAT CTCCAAGAT CATGAGAAGAAGCTGAGG 
CT GGT GTTT AAGATTTTG GACAAAAAGAAT GAT GGAC GCATTGACGC GCAGGAGATCATG CAGTC CCT GCGGGACTT G 
GGAGT CAAGATATCT GAACAGCAGGCAGAAAAAATTCT CAAGAGCAT GGAT AAAAAC GGCACGAT GAC CAT CGACT GG 
AACGAGTGGAGAGACTAC CACCT CCT CCAC CC C GTGGAAAACAT C CCC GAGAT CATC CTCTACT GGAAGCAT TCCAC G 
AT CTTT GATGTGGGT GAGAATCTAACGGT CCCGGAT GAGTT CACAGT GGAGGAGAG GCAGACGG GGAT GT GGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGA^ 
T C ACT CTG GCGGGGCAAT GGCAT CAACGTC CTCAAAAT TGC C CCC GAATCAGCCAT C AAAT T CAT GGCCTAT GAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
l ffh AT CGC CCAGAGCAGCATCTACC CAAT GGAGGTCC1 1 GAAGAC C C GGAT GGC GCT GC G GAAGACAGGCCAGTACTCAGGA 

k=j AT GCT GGACT GC GCCAGGAGGAT CCT GGCCAGAGAGG G GGT G GCC GCCTT CTACAAAGGCT AT GT CC C CAACATGCT G 

J-! GGCATCATCCCCTAT GCCGGCAT CGACCTTGCAGTCT ACGAGACGCT CAAGAATGCCT GG C T GCAGCACTAT GCAGTG 

C!j AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
fm CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
f\ TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
yj GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGG TGA CGGGGGGAGGGC 
:|| CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
\M GT CCT GCT GACC CCAGCAGACC CTCCT GTT G GTT C CAGCGAAGAC CACAGGCATT CCT TAG GGT C CAGGGT CAGCAGG 

CT CCG GGCTCACATGT GTAAGGACAG GACAT TTT CT GCAGT GC CT G C CAAT AGTGAG CTTG GAGC CTGGAGGCCG GCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACT^ 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCT GGACCCT GTCAGGATGGGCCCCACCTCAGAACCAAACT CACT GTCC CCACTGT GGCAT GAGGGCAGT GGAGCA 
C CATGTTT GAGGGCGAAGGGCAGAGCGTTT GT GTGTT CTGGGGAGGGAAGGAAAAGGT GTT GGAGGC CTTAATTAT GG 
ACTGTTGGGAAAAGGGT TTTGT C CAGAAGGACAAGCCGGACAAAT GAGCGACTT CTGT GCT T CCAGAGGAAGACGAGG 
GAG(^GGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGAC CAGCCCCACATTCCACTT GTGTCACTGCTT GGAACCTATTTATTTTGTATTTATTT GAACAGAGTT AT GT C CT 
AACTAT TT TTATAGATT TGTTTAATTAATAGCTT GT CATTTT CAAGT T CATTT TTTATTCATATTTAT GTT CATGGT T 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCC^GAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCT GCT GCC CTTGCTTAACAATGCCGGCCAACT GGCGACCT CAC GGTT GCACT TCCATT CCAC CAGAAT GACCT GA 
TGAGGAAAT CTT CAATAGGATGCAAAGAT CAAT GC AAAAATTGTTATATAT GAACATATAACTGGAGT CGT CAAAAAG 
CAAATTAAGAAAGAATTG GACGT TAGAAGTTGT CAT TTAAAGC^GC CTTCT AATAAAGT TGTTTCAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



App_ID=10063581 



Page 209 of 320 



FIGURE s8 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKS I FKLSVFI PSQEFS TYRQWKQKI VQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKI LDKKNDGRI DAQE IMQSLRDLGVKI SEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE I ILYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVmQVHASRSNMdGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAlK 
FMAYEQ I KRLVGS DQET LRI HE RLVAGS LAGAI AQ S S I Y PMEVLKTRMALRKT GQYS GML DCARR 
I LAREGVAAFYKGYVPNMLGI I PYAG I DLAVYE TLKNAWLQHYAVNS ADPGVFVLLACGTMS S T C 
GQLAS YPLALVRTRMQAQAS IEGAPEVTMS SLFKHI LRTEGAFGLYRGLAPNFMKVTPAVS I S YV 
VYENLKI TLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE so 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
T TCCCTGGGGCAGAT CCTCTTCT GGAGCATAAT TAGCATCATCATTAT TCT GGCT GGAGCAAT TG 
CACTCATCATTGGCT TTGGTAT T TCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT TGGGGAGGATGGAAT CCTGAGC TGCACT TTTGAACCTGACATCAAACTTTCT GATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGC C TC T T T G CGGC TGAAAAACG T GCAAC TCACAGATGC TGGCACC TAC AAAT G T TATAT 
CAT CAC T TCTAAAGGCAAGGGGAAT GCTAACC TTGAGT ATAAAACT GGAGCCT TCAGCAT GCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAG TGGTC T GGGCATCCCAAGTTGACCAGGGAGCCAAC T TCTCGGAAGTCTCCAAT AC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
T CAACAACACATACTCCT G TATGAT TGA£AAT GACAT TGC CAAAGCAACAGGGGATAT CAAAGT G 
ACAGAAT CGGAGAT CAAAAGGCGGAGT CAC CTACAGCT GC TAAACT CAAAGGCT TC TCT GTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GTGCCTTGGCCACAAAAAAGCAT GCAAAGTCAT TGTTACAACAGGGATCTACAGAACTATT TCAC 
CAC CAG ATAT GACC TAG T T T TATAT T T C T GGGAGGAAAT GAAT T CAT AT C T AGAAG T C T G GAG T G 
AG C AAAC AAGAG CAAGAAACAAAAAGAAGC CAAAAGCAGAAGG C T C C AATAT GAACAAGAT AAAT 
CTATCTTCAAAGACATAT TAGAAGT TGGGAAAATAAT TCATGTGAACTAGACAAGTGTGTTAAGA 
GTGATAAGTAAAATGCACG TGGAGACAAG TGCATCCCCAGATC TCAGGGACCTCCCCC TGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
T G T AATGT TGCT CT GAGGAAGCCCC TGGAAAGTC TATCCCAACATAT CCACATC T TATATT CCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T CT T CCCAAC TGACAAATGCCAAAGT TGAGAAAAAT GATCATAATT T TAGCATAAACAGAGCAGT 
C GGGGACACCGAT T T TATAAATAAACTGAGCACC T T C T TTTTAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA2^AAAAAAAAAA 
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FIGURE 60 



MAS LGQ I L FWS 1 1 S 1 1 1 1 LAGAIAL IIGF'GI SGRHS I T VT TVASAGNI GEDGILSCTFEPDIKLS 
DI VI QWLKEGVLGLVHEFKEGKDELSE QDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGT YKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTVVWASQVDQGANF5EVS 
NTSFELNSEIWTMKWSVLYIWTIlNnsfTYSCMIENDIAKATGDIKY 
CVSSFFAISWALLPLSPYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 253-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAAT CACCATGGCCAGCTATCC TTACCGGCAGGGC TGCCCAGGAGCTGCAGGACAAG 
CAC CAGGAGCCC CT CC GGG TAGCTACTACCCT GGAC CCCC CAATAGT GGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
lA GG T GGAC TCAGATCACAGTGGCT AT AT CT CCAT GAAGGAGC TAAAGGAGGCCCT GG T CAACT GCA 

H AT T GGTC T TCAT TCAAT GATGAGAC C T GC C T CAT GAT GAT AAACAT GT TTGACAAGAC CAAGT CA 

Q GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
0 '; C CAGCAGT ATGACC GGGAC CGC T CG GGC T CCAT TAG C TAC ACAGAGC T GCAGCAAGC T C T G T CC C 

% AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
m GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
jwfc" GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
is C CAT GACAGCT T C T CGGAT GC T ATGAC CCAAC CATC T GT GGAGAG T GGAG T GCACCAGGGAC CT T 

T CCTGGCT TC T T AGAGT GAGAGAAGTAT G T GGACAT CTCTTCTTTTCCTGT CCCTCTAGAAGAAC 
%l ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
U\ AAT AGTGAGGAC CGGGGC TGAGGCCACACAGATAGGGGCC TGAT GGAGGAGAGGATAGAAG T TGA 

Q ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
ill AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCAT GAGAGAGATC T GT CT AT GGGACCAGTGGC TTGGAT TCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T TGGGGCCAAAAG TCCAGTGAAAT TGTAAGCT TCAATAAAAGGAT GAAACTC TGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGY I SMKELKQALVNCNWS S FNDET CLMMINMFDKT KS GRI DVYG FSALWKFI QQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6s 

CAGG ATGC AGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTGTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
t!! TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
&\ TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
rjpj CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
UJ TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
Wl ATC T GGGG TAT CAATAAT CAACCCCCAGG TACCAGC TGGGGAAATAT TAAT CGG TATCCAGGAGG 

m CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGG7\ATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
Q CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
Lfl GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
M TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
T GC TCATC TC CAAT AAAATAAAAGCACT TATGAAAAAAAAAAAAAAAAAAAAAAAZ\AAA?^AAAAA 

CI 

m AAAAAAAAAAA?\AAAAAAAAAAAAAAAAAAAAAAAAAAAA&AAAAA 
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FIGURE 6a 

MQGRVAGSCAPLGLLLVCLHLPGLFARS IGWEEKVSQNFGTNLPQLGQPSS TGPSNSEHPQPAL 
DPRSNDLMVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPMDSWPPEDPWQI^MMAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Casein kinase II phosphorylation sites, 

amino acids 56-59, 155-158 

N-myr i s toy 1 a ti on sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6 s 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAAGTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site, 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
U GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
•GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGAC TCTGCCCCT GAGGT CAAGAGAGGAT GGGGC TAT T C AC T T T TAT ATAT T TATATAAAAT TAG 
TAGT GAGAT GTAAAAAAAAAAAAAAAAAA 



0 

W . 

UJ 



m 
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FIGURE 68 

MANPGLGLLLALGL P FLLARWGRAWGQ IQTTSANENST VLPS ST SS S S DGNLRPEAI TAI I WFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPS SEEQFSHAAEARAPQDSKE TVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 



yj N-glycosylation site* 

Uf'f amino acids 36-40 



3 

f' ; j amino acids 86-90 



cAMP- and cQtfP-dependent protein kinase phosphorylation site. 



Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites, 
amino acids 7-13, 26-32 
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FIGURE 60 



GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCTGTGCCTGCTGCACCAG 
TCAAAT ACT T CCT TCATTAAGCT GAATAATAATGGCTT T GAAGATATTGTCAT TGTT ATAGATC CTAGTGT GCCAGAA 
GAT GAAAAAATAATTGAACAAATAGAGGATATGGT GACTACAGCTT CTACGTACCT GTTTGAAGCCACAGAAAAAAGA 
TT TTTT TTCAAAAATGTATCTATATTAATT CCTGAGAAT TGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACAT GAA 
AACCATAAACATGCTGATGTTATAGTTGCA^ 

GAATGT GGAGAGAAAGGCGAATACATT CACT T CACCCCT GACCTT CTACTTGGAAAAAAACAAAATGAATAT GGACCA 
CCAGGCAAACTGTT TGTCCAT GAGTGGGCTCACCT CCGGTGGG GAGTGTTT GAT GAGTACAATGAAGATCAGCCTT TC 
TACCGT G CT AAGT CAAAAAAAAT C GAAGCAACAA(5GTGTT CCGCAG GTAT CTCT GGTAGAAATAGAGTTTATAAGT GT 
CAAGGAGGCAGCT GTCT TAGTAGAGCAT GGAGAATT GATT CT AGAACAAAACT GTATGGAAAAGATTGT CAATT CTTT 
C CT GAT AAAGTACAAACAGAAAAAGCAT CCATAAT GTTTAT GCAAAGTAT T GATT CT GTT GTT GAATT TT GT AAC GAA 
AAAACC CAT AAT CAAGAAGCT CCAAGCCTACAAAACAT AAAGTG CAATT T TAGAAGTACAT GGGAGGT GAT TAGCAAT 
TCTGAGGATTTTAAAAACACCATACCCATGGTGAGACCACCT CCTCCACCTGT CTT CTCATTGCTGAAGAT CAGT CAA 
AGT^TTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCTAAATCGAATGAATCAAGCAGCA 
AAACAT TTCCTGCTG C AGACT GT T GAAAAT GGAT CC TGGGT GGG GATG GTT CACTT T GAT AGTACT GC CACTAT T GTA 
AATAAGCTAATC CAAAT AAAAAGCAGT GATGAAAGAAACACACTCATGGC AGGATT AC C T ACAT AT CCT CT GGGAGGA 
ACTTCCATCTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGATCCGAAGTA 
CT GCT GCTGACT GAT GGGGAGGAT AACAC T GCAAGT TC T TGTATT GAT GAAGT GAAACAAAGT GGG GC CAT TGTT CAT 
TTTATT GCTTTGGGAAGAGCT GCTGATGAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTAT GTT 
TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
T C CCT T CAG C T C GAAAGTAAGGGAT TAACACT GAATAGTAAT GCCT GGAT GAACGAC ACT GTCATAATT GAT AGTACA 
GT GGGAAAGGACACGTTCTTT CT CATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCT GGGATCCCAGTGGAACA 
kll ATAATGGAAAATTTCACAGTGGATGCAACTTCCA?AATGGCCTATCTCAGTATTCCAGGAACTGCAA2VGGTGGGCACT 
TGGGCATACAATCTTCAAGCCAAAGCGAACCCAGAAACATTAACTATTACAGTAACTTCTCGAGCAGCAAATTCTTCT 

gtgcctccaatcacagtgaatgctaaaatgaataaggac 

attctacaaggatatgtacctgttcttggagcc^lTgtgactgctttcattgaatcac^gaatggacatacagaagtt 
ttggaacttttggataatggtgcaggcgctgattctttcaagaatgatggagtctactccaggtattttacagcatat 
acagaaaatggc^gatatagcttaaaagttcgggctcatggaggagcaaacactgccaggctaaaattacggcctcca 

CT GAATAGAGCCGCGTACATACCAGGCTGGGTAGI'GAACGGGGAAATTGAAGCAAACCCGCCAAGACCTGAAATTGAT 
C^GGATACTCAGACC^CCTTGGAGGATTTC^GCCGAA^ 

CTTCC CT T GC CT GACCAATAC CCACCAAGT CAAAT CACAGAC CTT GATGC CACAGT T CAT GAGGATAAGATTATT CTT 
Ll| ACATGGACAGCAC CAG GAGAT AATTTT GAT GTTGGAAAAGTT CAAC GTTATAT CAT AAGAATAAGTG CAAGTATT CTT 

GATCTAAGAGACAGTT T TGAT GATGCT CTTCAAGTAAATACTACT GATCT GTCACCAAAGGAG GCCAACTC CAAGGAA 
AGCTTTGC^TTTAAACCAGAAAATATCTCAGAAGAAAATGC^\ACCCACATATTTATTGCCATTAAAAGTAT 
|J AGCAATTTGACATCAAAAGTATCCAACATTGCAC^AGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGACATTGAT 
CCTACACCTACTCCTACT CCTACTCCTACTC CTGATAAAAGT CATAATTCT GGAGT T AAT AT TT CTACGCT GGTAT TG 
T CTGT GATT GGGT CT GTTGT AATT GTT AACTTTATT TTAAGT ACCACCATTTGAAC CTTAACGAAGAAAAAAAT CTT C 
AAGTAGACCT AGAAGAGAGTTT T AAAAAACAAAAC AAT GTAAGTAAAGGATAT TTCT GAAT C TTAAAATT CATCCCAT 
GT GTGATCATAAACTCATAAAAAT AATTT TAAGAT GT C GGAAAAGGATACT TT GATTAAATAAAAACACTCATGGATA 
TGTAAAAACTGTCAAGATTAAAATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAACAAAG 
AT CCT TTTT CATACTGATACCTGGTT GTATATTAT TTGATGCAACAGTTTTCT GAAATGATATTTCAAATT GCAT CAA 
GAAAT TAAAATCATCTATCT GAGT AGT CAAAATAC AAGTAAAG GAGAGCW^TAAACAAC^TTTGGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 



hit 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPElOTKENPQYKRPraENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSVVEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIWKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGL I DAFGALT SGKTDLSQKSLQLESKGLTLNSNAWMNDT VI I DSTVGKDT FFL 
I TWNSLPPS I S LWDPSGTIMENFT VDAT SKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 
AANSSVPPITVNAKMNKDVNS FPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADS FKNDGVYS R Y FTAY TENGRYS LKVRAHGGANTARLKLRPPLNRAAY I P GWWNGE I EANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYIIRISAS 1LDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site* 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGGGCAGGGGTGA 
C7\ACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAGCCAC CT C CTAACCTTC CCTTCTTGAAC C CCCAGTTATGC CAGGATTTACTAGAGAGT GTCA 
ACTCAACCAGCAAGC GGCT CCTT CGG CTTAACTT (STGGTT GGAGGAGAGAACCTTT GTGGGGCT GCGT TCT CT TAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGi\AGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGTAGAGAAGCTGCTC TGT GT GGT GGTTAACT C CAAGAGGCAGAACTCGT T CTAGAAGGAAATGGAT GCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTG AATGA TGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGW^GGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CGCAACTACGTGAGCAGCCT GAAGCGGCAGATCGCACAGCT CAAGGAGGAGCT GCAGGAGAGGAGTGAGCAGCTCAGG 
y t AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CTGGCCTTCCTGCACT CGCAGGT GGACAAGGCAGAGGT GAATGCT GGCGTCAAGCT GGCCACAGAGTATGCAGCAGTG 
CCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCGAGGAGAAGCCTGTG 
f * I AGGAAGGACAAGCGGGATGAGTT GGTGGAAGCCATTGAATCAGCCTTGGAGACCCT GAACAATCCTGCAGAGAACAGC 

CCCAAT CACC GTC CTT ACAC GGCCT CT GATT TCATAGAAGGGAT CTAC CGAACAGAAAGGGACAAAGG GACAT T GTAT 
GAG CT CACCT T CAAAGGGGAC CACAAACACGAAT T CAAACG GCT CATCT TATT T C GAC CAT TCAGC C C CAT CAT GAAA 
Lj J GT GAAAAAT GAAAAGCTCAACAT GGCCAACACGCTTAT CAATGTTATC GTGCCTCTAGCAAAAAGGGT GGACAAGTTC 

pS J CGGCAGTTCATGCAGAATTT CAGGGAGAT GT GC AT T GAGCAG GAT G G GAGAGT CCATCTCACTGTTGTTTACTTT GGG 

M I AAAGAAGAAATAAAT GAAGT CAAAGGAATACTT GJIAAACACT T C C AAAGCT GC CAACTT CAGGAACTTTACCT T CATC 

QjJ CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 

TTTTTCTGTGATGTGGACATCTACTTCACATCTG;iATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 
GT ATTT TAT C CAGTT CTTTT CAGT CAGT ACAAT C CT GGCATAATAT AC GGC CACCAT GAT GCAGT CCCT CC CT T GGAA 
Si CAGCAG CTGGT CATAAAGAAGGAAACTGGATTTT GGAGAGACT TT GGAT TT GGGAT GACGT GT CAGTATCGGT CAGAC 

f? TTCATCAATATAGGTGGGTTTGATCTGGACATCA^^AGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTC 
U lt CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAG 
j.1 1 CT GACCCCCGAGCAGT ACAAGAT GTGCATGCAGTCCAAGGCCATGAACGAGGCATC CCAC GGC CAGCT GGGCAT GCTG 

%{ GT GTT CAGGCACGAGAT AGAGGCTCACCTT CGCA7 ACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACT CCCAGA 

;'- ! : GAAGGATT GT GGGAGAC ACTTTTTCTT T CCT TTT GCAATTACT GAAAGT GGCT GCAACAGAGAAAAGACTT CCATAAA 

j.] | GGACGACAAAAGAATTGGACTGATGGGTCAGAGAT GAGAAAGCCT CCGATTTCTCT CTGTTGGGCTTTTTACAACAGA 

pi AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
J: f AGATTATAAGCCTAAT GGTGT GGAGGTT TTGATGGTGTT TACAAT ACACT GAGAC CT GTT GTTTTGT GTGCTCATTGA 

f | j AATAT T CAT GATT TAAGAGCAGTTT TGTAAAAAATTCATTAGCAT GAAAG GCAAGCATAT TTCT CCTCATATGAATGA 

GC CTAT CAGCAG GGCT CTAGT TT CTAGGAAT GCTAAAATATCAGAAG GCAG GAGAGGAGATAGGCTTAT TATGATACT 
AGTGAGTACATTAAGTAAAATAAAATGGACCAGA^AAGAAAAGAAACCATAAATATCGTGTCATAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTT TTT CCCT TGT GAGTTATAGT CTGCTTATTTAATTACCACTTT GCAAGCCT TACAAGAGAGCACAAGTT GGC CTAC 
ATT TTTATAT TTTTTAAGAAGAT ACTT T GAGAT GC AT TAT GAGAACTTTCAGT TCAAAGCATCAAATTGAT GCCATAT 
CCAAGGACAT GCCAAAT GCT GAT TCT GT CAGGCAGTGAAT GT CAGGCATT GAGACATAGGGAAGGAAT GGTTTGT ACT 
AATACAGACGTACAGATACTTT CTCT GAAGAGTATT TT CGAAGAGGAGCAACT GAACACT GGAGGAAAAGAAAAT GAC 
ACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGT^ 

CT CCT CAGAAGTAGGGACCGCTTT CTTACCT GTTT AAATAAACCAAAGTATAC CGT GTGAACCAAACAAT CT CTT TTC 

aaaacagggtgctcctcctggcttctggcttccataagaagaaatggagaaaaatatatatatatatatatatattgt 
gaaagatcaatccat ct gccagaatctagtgggat' ggaagttt tt gctacatgttat ccac cccaggccaggtggaag 
taactgaattattttttaaattaagcagttctactcaatcacc^ 

CAAACTATTTTTTAAAAATAAATACAGT TAACATAGAGTGGTT TCTT CATT CATGT GAAAAT TAT TAGCCAGCACCAG 
AT GCAT GAGCTAATTAT CTCT TT GAGT CCTT GCTT CTGTTTG C TCACAGT AAACT CATTGTT TAAAAGCTT CAAGAAC 
AT TCAAGCT GTT GGTGT GTTAAAAAAT GCATT GT ATTGAT TT G TACT GGT AGTTTAT GAAATTTAATTAAAACACAG G 
CCATGAATGGAAGGTGGTATTGCACAGCTA^TAAAATATGATTTGTGGATATGAA 
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FIGURE 72 

MMMVRRGLIAWI SRVWLLVLL^ PTGKEGYQAVLQEWE 
EQHRlSrrV'SSLKRQIAQLKEELQERSEQLPNGQYQASDi^GLGLDRSPPEKTQADLLAFLHSQV^ 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 
T L I NVI VPL AKRVDK FRQ FMQNFREMC I EQDGRVHL T WY FGKEE I NE VKG I LENT S KAANFRN F 
TFI QLNGE FSRGKGLDVGARFWKGSNVLL FFCDVDI YFTSE FLNTCRLNTQPGKKVFYPVL FSQY 

NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNL I WRT PVRGLFHLWHEKRCMDE LT PE QYKMCMQSKAMNEAS HGQLGMLVFRHE I EAHL 
RKQKQKTS SKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites . 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAAC CCAT GATACCC TAC T GAACACCGAATCCCC TGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACAXGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAAT G CC CAGTCGCCCAT CG ATAT TCAGACAGACAG TGT GACAT T TGAC CCTGATT T GCC 
%l TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
%l ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
jjl GCCCAGCTCCACCTGCACTGGGGTCAGASAGGATCCCCAGGGGGGTCAG2\ACACCAGATCAACAG 
III TG AAGCC AC AT T T GCAGAGC TC CACAT T G TACAT TAT GAC T C T GAT T CC TAT GACAGC T T GAG T G 

^ AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
r !' AATATAGCT TATGAACACATTCTGAGTCACTTGCATGAAG TCAGGCATAAAGAT CAGAAGACC TC 

! s AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
Q GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
111 ATT TCAATGGAACAGCTGGAAAAGCT TCAGGGGACATTGTT CT CCACAGAAGAGGAGCCCTCTAA 

W GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
7CCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
fil TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCT TCACCTCAGCACAAGCCACGAC TGAGGCATAAAT TCC TTC T CAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACAC TG TAGGAGTAGTAAGCAGATG TCCTCCTT CCCCTGGACATC TCT TAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C TC C T TAGGATAAAGAGT TGCT GT TGAAGT TGTATAT TT T TGATCAATATATT T GGAAATTAAAG 
TTTCTGACTTT 
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FIG URE 74. 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 



V: 



r 5 ^ 
f I f 



Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7 5 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTG.TGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGa ATG TCCTACA 
ATGGACT CCACCAGAGGGTC T T CAAGGAG T TAAAGT T AC TT ACAC TGT GCAG TATT TCAT CACAA 
ATT GGCCCACCAGAGG TGGC AC T GACTACAGATGAGAAGT CCAT TTC TGT T G TC CT GACAGC TCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAA^ L TCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAS.CACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
%\ CAGAGTT CAAGGCTAAAAT CAT CT T C TGG TAT G T TT T GCCCATATCTAT TACCGT GT T T CT T TTT 

Q) TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
yl • GAT TTT GAT TTATG GAAAT GAAT T T GACAAAAGAT TCTTTGTGCCTGC TGAAAAAAT C G T GAT TA 

4 - ACT T TAT CACCC TC AATAT CT C G GAT GAT TCTAAAAT TT C T CAT CAGGATAT GAG T T T AC TGGGA 

'f* AAAAG CAG T GAT GTAT CCAGCC T TAATGATCC TCAGC CC AGCG GGAAC CT GAGGCCCC C T CAGGA 

' s GGAAGAGGAGGTGAAACATT TAGGGTATGCT TCGCAT TT GATGGAAAT TT T T TGTGAC TCTGAAG 

gj AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
111 ACAGT CAT T GAATATGAATATGAT G T CAGAACCAC TGACAT TT GTGCGGGGCCT GAAGAGCAGGA 

^ GCT CAG TTT GCAGGAGGAGGT GTCCACACAAGGAACAT TATT GGAGTCGCAGGCAGCGT TGGCAG 

UJ 

%i TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 

y . 

m CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAAT GAAACCTATCTCATGCAAT TCATGGAGGAATGGGGGT TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYWSVLNTKSNRTWSQCVTNHTLVI.TWLEPNTLYCVHVESEVPGPPRRAQPSEKQCARTLK 
DQSSE FKAKI I FWYVLPI S ITVFL FSVMGYS I YRYIHVGKEKHPANLI LI YGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVXGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQe4EEWGLYVQMEN 



rj Important features: 

Qi Signal peptide: 

amino acids 1-28 



Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
C C AAT C T AGTGAAAAACC AGCT GTGTCCCGT GAT G GAG GCTTCCTT CAAT GG CAT GT AT GC AG AC 

CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGT TCAATAACTCT GCAGCT TCCCT GACAAT GCCCACCC TGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATCAATGAAAAGGC TGCAGATAAGCTGGGATCTACCCAGAT CG TGAAGATCC TAAC TCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCGTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TTTACACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVI.ILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVLGSLVNTVLKHI IWLKVI TANILQLQVKPSANDQELLVKI PLDMVAGFNTPLVKT IVE 
FHMTTEAQATIRMDTSASGPTR^^ 

VKNQLC PVI EAS FNGMYADLLQLVKVPI SLS I DRLE FDLLYPAI KGDT I QL YLGAKLLDSQGKVT 
KWFNNSAAS LTMPTLDNI P FSL IVSQDWKAAVAAVLS PEE FMVLLDSVLPESAHRLKS S I GL IN 
EKAADKLGSTQIVKILTQDTPEFFIDQGtEAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNNI S SDRI QLMNSG I GWFQPDVLKNI I TE I IHS I LLPNQNGKLRSGVPVSLVKALG 
FEAAE SSLT KDALVLT PAS LWKPSS P VSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites* 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAG T CAGCC T GGCAGAGAGAC TC TGAAATGAGGGATTAGAGGTGTT CAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCGTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 

f 

f s 1 GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
p ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
fl; CAT GAAAT T T GAGAT TGGAGAGG C T C T T TAC T TGGG C AT TAT TTCTTCCCTGTTCTCCCT GATAG 

* CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
% CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
M GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
$ GG T CT G TGAAAAACAG TGGACAGCAC CCCGAGGGCCACAGGTGAGGGACACTAC CACT GGAT CGT 

W . GTCAGAAGGTGC TGC TGAGGAT AGAC TGAC T T TGGCCAT TGGAT TGAGCAAAGGCAGAAATGGGG 

^ GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
u'i TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCGATTCCCTTAAGCCA 
r| GGACTCAGAGGATCCCTTTGCCGTCTGGTTTACCTGGGACTCCATCCGCAAACCCACTAATCACA 
jll TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGGATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATTTGTGCATGAACTGAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGCC TGGGACAT T TAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWPCTSS YVGAS IVTAVGFSKGLWMECATHSTGI TQCD 
IYSTLLGLPADIQAAQMWTSSAISSI^.CIISWGMRCTVFCQESRAKDRVAVAGGVFFILGGL 
LG FI PVAWNLHG ILRDFYS PLVPDSMK FE IGEALYLGI I SS LFS L I AG 1 1 LCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, . 117-140, 163-182 

N-glycosylation site* 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCCIA.CCCCGCTCTAGCAGCAGACCTCCTGGGGTGTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGC CTGACCCTGGGGAAAGGATGGTTCCCGAGGT GAGGGTCC TC TCCT CCTT GC TGGGA 

CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCC CCGGCGAGAGCTGGCAC CCC T AC T T GGAGCCACAAGGC CTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TGCCT CT GGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATC TAC T GC GGCC T CACAAC C T GC C C CGAAC CAG GC TG CC CAGC AC C CC T CCC 
ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACT G T CAAGAT C GT CC TGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAft.GTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
C TCGGACT TGGTGGAGATCTACCT CT GGAAGCTGG TAAAAGAT GAGGAAACTGAGGCT CAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGT CT TCCTAGCCCAGAC CC TGGAGCTGAAGG T CACGGC CAGTCCAGACAAAGT GACCAAG 
ACATAACAAAGACC TAACAGT T GCAGATATGAGCTGT AT AAT T GT TGTTAT TAT AT AT TAATAAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAZ^IAAAAAAAAAAA 



FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGE I FSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide : 
amino acids 1-25 
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FIGURE 83 



GACAGCTGTGTC TCGATGGAGT AGACTCTC AiSAACAGCGC AGTT TGCCCTCC GC TC ACGC AGAGCCT C TCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
T CCGTTTCCATGCCGTGAGGT C CATTCACAG^I^ACAC ATCCATGGCTCT C ATGCT CAGTTTGGTT CTGAGTC 
T CCT C AAGCT GGGATC AGGGCAGT GGC AGG T GTT TGGGCCAGACAAGCCTGT CC AGGC CTTGGT GGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAiflGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTT CTCT AGCGT GGT CCACCT CT AC AGGGAC GGGAAGGACC AGCC ATT T AT GC AG ATGCCACAGTATC 
AAGGC AGG AC AAAACT GGT GAAGGATTCT ATT GCGG AGGGGC GC ATCT CT CT GAGGCT GG AAAAC ATT ACT 
u>s GTGT TGGATGCT GGC C TCTATGGGTGCAGGATTAGTTCCC AGTC TTACT ACC AGAAGG CCAT CT GGGAGCT 

'f\ AC AGGTGTCAGC AC TGGGCT C AGT T CCTCTCATT TCCATC AC GGGAT ATGTT GAT AGAGACATC C AGCT AC 

r\ TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
□1 ACAGACTCCAGGAC AAACAGAGAC ATGC ATGGCC TGTT TG AT GT GG AG AT CT C T CTGACCG T CC AAGAGAA 

Lll CGCCGGGAGC AT AT CCTGTT CC AT GCGGCATGCT CATCTGAGCCGAGAGGTGGAATCC AGGGTAC AGAT AG 

Ul GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 

m 

'?* TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
?** GAGAAG AAAG CACGGACAG GCAGAATTG AG AGAC GC CCGG AAAC AC GC AGTGGAGGTG ACTCTGGATC CAG 

1- AGACGGC TCACCCGAAGC T CTGCGTTTCTG AT CT GAAAACTGTAACCCAT AGAAAAGC TCCCCAGGAG GTG 

Li 

*«I CCTC AC TCTGAGAAG AGATT T ACAAGGAAGAGTGTGGTGGCT TCTC AG AGTT TC C AAGCAGGGAAAC ATT A 

CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
i s i AGGAGT ACGT GACT T TGTCT CCCGAT CATGGGTACTGGGTCCTC AG ACTG AAT GGAGAAC AT TT GT ATTTC 

Q ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
fjj TGAGT GTGGGAC CATCTCCT T C TT CAACAT AAATGACCAGTCCCT T ATTTAT AC CCTGAC AT GT CGGTTTG 

AAGGCTTATTGAGGCCCTACATTGAGTATCCC;TCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CC AGTC ACCCAGGAAT CAGAGAAAGAGGCCTC TTGGCAAAGGGC CTCTGCAATCCC AGAGACAAGC AACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGAC ATT AC AT TT AGTTTGCTCT C ACTCC AT CTGGCT AAGT GATC TTGAAATACC ACCT CT CAGGTG AAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
AT CT T ATT GAT GAC AG AGTGTATC CT AAT GGT TTGTTCATTATATT ACACTT TC AGTAAAAAAA 
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FIGURE 84 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 
L YRDGKDQP FMQMPQYQGRTKLVKDS IAEGRI SLRLENI TVLDAGL YGCRI S S QS YYQKAI WELQ 
VSALGSVPLI S ITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISGSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WASQS FQAGKHYWEVDGGHNKRWP.VGVCRDDVDRRKEYVTLS PDHG YWVLRLNGEHL YFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

m Signal peptide: 

U| amino acids 1-17 

\J 1 

Transmembrane domain: 

amino acids 239-255 

yi 

! ,i - 

Til - : 
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FIGURE 8 s 

AA-CAGACGXTCCCTCGCGGCCCTGCjCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACT T CT T TCGTATGGAGAAAGGAAGTATAAAATGGAAT TATAAACATCACCGGCTCT CTG TGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
% AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
fi% GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTGACCCTCATCCCACA 
Uj GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

$1 ACAAG AC C G T C CAT CT C AAC G T G T C C T AC C C G C C T C AGAAC T T GAG CAT G AC T G T C T T C C AAGGA 

fi% 

fl GACGGCACAGTATCCACAGTCTTGGGAM.TGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
Q GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
ijl CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
i] GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
% CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
fl] TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCTGAT TC TT GTAGAATTAACAGCCC TCAACGTGAT GAGC TATGATAACAC TATGAAT TATG 
TGC AGAGT GAAAAG CACACAGG C T T TAGAG T CAAAG TAT C T CAAAC C T GAATCC AC AC TGT GCC C 
TCCC TT TTAT TTT T T TAACTAAAAGACAGACAAAT T CCT A 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPVVHGYWFREG 
ANT DQDAPVATNNPARAWEETRDRFHLLGDPHTKNCTLS I RDARRS DAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
L GSQQVYLNVS LQS KAT S GVTQGWGGAG ATALVFLS FCVI FWVRS CRKKS ARPAAGVGD T G I E 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGC TGAGAC TCAGACAAGAT TACAATGAACCAAC TCAGC TTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAG TACA.GATGAGGC TAAT ACT TAC T TCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGC GATC GCTGG TC CAGTCAGC AGGGCAGCAAAGC AGAC TACC CAG AG GGGGACGGCAAC T GGGC 
H CAAC T AC AACAC CT T TGGATC T GC AGAGGCGGCCACGAGCGAT GAC TACAAGAACC C T GGCTAC T 

n 

r Zl ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
yj G T T T GG C AT CTACC AGAAAT AT CC AGT GAAATATGGAGAAGGAAAG TG T T GG AC T GACAAC GG CC 

Lll CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
^1 GGCCAGCGGGAAT TCACTGCGGGAT TTGTTCAG TTCAGGGT AT TT AATAACGAGAGAGCAGCCAA 

CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
%\ GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
\V\ GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
CI TTGAGAGT T T T GTGGGAGGGAACCCAGAC CTCTCCT CCCAACCATGAGATCCCAAGGATGGAGAA 

CAAC T TAC C CAG T AGC T AG AAT G T T AAT GGC AGAAG AG AAAAC AAT AAAT CAT AT T GAC T CAAGA 
W AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQT FC DMT S GGGGW TLVASVHENDMRGKC TVGDRWS S QQGSKADYPE GDGNWANYNT FGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVI PWYDFGDAQKTAS YYS PYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 



AppJD=10063581 



Page 240 



FIGURE 8q 

C TAGATT TGTCGGC T TGC GGGGAGACT TCAGGAGTCGCT GT CT CTGAACT TCCAGCCT CAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGGCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
N TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGGTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCT TCGTATTCC TGGAAGAAGGAAG 
Q) GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
Lij TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
J1 T TAAGAC T TATATACAGT T T TAGGGGACAAT TAAAAAAAAAAAA • 

yj 

11 • 
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FIGURE qo 



MGRVSGLVPSRFLTLLAHLVWI TLFWS RDSN I QACLPLT FT PEE YDKQDI QLVAALSVT LGL FA 
VEIAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAG ATGC AACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTGTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAAC ACAATGC CACAGGCC AGGGAAACAT C TC CATC AGCC TC GT GCCCCCCAGTAAAGCTGTAG 
AGT TCCACCAGGAACAGCAGAT CTTCATCGAAGCCAAGGCCTCCAA?\ATC T TCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGG T CCCAAGTGC TGGT CCCAACC TGAAGCTGTGGAGT GACTAGATCACAGGAGCAC TGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTC^GAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTT C TTGCCATCC TGAGGAAAGAT AGCAACAGGGAGGGGGAGATT TCAT CAGT GT GGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCC T TAAGCTAAGAC AGGAC GAT TGTGGTCC T CCCACAC TAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGGA 
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FIGURE Q2 

MQLTRCC FVFLVQGSL YLVI CGQDDGPPG S EDPERDDHE GQPRPRVPRKRGH I S PKSRPMANS TL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNIS ISLVPPSKAVEFHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 

Signal peptide: 
amino acids 1-14 

Q\ N-glycosylation sites. 

LlJ amino acids 62-65, 127-130, 137-140, 143-146 

1 sr. 

■Ln 
pi 

f 2-oxo acid dehydrogenases acyl transferase 

M 

^ amino acids 61-71 

0 
Ui 

;»? ;: 

WiS 

Uj 

m 
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FIGURE Q3 

C GGTGGCC ATGA CTGCGGCCGTG TT CT TC GGCTGCGCCT TCAT TGCCT TCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
C TGG TTGG TG TC TC TACTGAT T TCG T CCC T TGT T TGGTT CATGGCAAGAG T CATTAT T GACAACA 
AAGAT GGACCAACACAGAAATATCT GCT GATC T TTGGAGCGTT TGT C TC TGTCTAT AT CCAAGAA 
ATGTTCCGAT T TGCATATTATAAACT CT TAAAAAAAGCCAGTGAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCC TGAAAC TCTGCCTGC TC TGCCAAGACAAGAACTT TC T T CT T TACAACCAGCGC TCCAGAT 
AAC CT CAGGGAACCAGCACTTCCCAAACCGCAGAC TACATCT T TAGAGG7VAGCACAAC T GT GCC T 
T TT TC TGAAAATCCC T T T T TCT GGT GGAAT TGAGAAAGAAATAAAAC TATGCAGAT A 
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FIGURE Q4 

MTAAVFFGCAFIAFGPALALYVFT I AI EPLRI I FL IAGAFFWLVSLL I SS LVWFMARVI I DNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFTOTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 
amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 

AATT TTT CACCAGAGTAAACT T GAGAAACCAAC TGGACCT T GAG TAT TGTACATT TTGCCTCG TG 
GACCC AAAGGTAGCAATCTGAAAC ATGA GGAGTACGATT C TAC TGTT T TGTC TTC TAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAAC AC TAC CAAAC C AAC AG C AG T C AAAT C AGG T CTTTCCTTCTT T AAGT C T G AT ACC AT T AAC A 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACC CACCCAT TGACCCTGGGAGGGT TGAATGTACAACAGCAACTGCACCCACAT G TG TTAC 
CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGC CGCC TCCCAACT CCC AGTGGCACAGAT GACGAC T TT GCAGT GACCACCCCT 
GCAGGCATCCAAAGGAGCACAC AT GCCAT CGAGGAAGCC AC CACAGAATCAGCAAATGGAAT TCA 
G TAAG CTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT T GAT TAT AT TAT GGAATAGAT T G AGAC AC AT T G GATAGT CT T AGAAGAAAT TAAT T C T TAAT T 
TAC C T G AAAAT AT T C T T GAAAT T T C AG AA AAT A TGTTCTATG T AGAGAAT C C C AAC TTT T AAAAA 
CAATAATTCAATGGATAAATCT GT CT TTGAAATATAACATTATGCTGCCTGGAT GATATGCATAT 
TAAAACATAT TTGGAAAACT GGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQI FTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGWPATQGTPAGRLPTPSG 
TDDD FAVTT PAG I QRS THAIEEATTES ANG I Q 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGT TCCAGAAAACTATGGTGGAAATTTCCCTT TATACCTGACCAAG T TGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
f% ACCAGCCT TGAC CACGCCC TGGAGAGGACC TACCAGC TGT T GGTACAGGTCAAGGACATGGGT.GA 

% CCAGGCCTCAGGCCACCAGGCCACTGCCA.CCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
^ CCCTAGAGCCTATCCACCTGGCAGAGAAT CTCAAAG T CCTATACCCGCACCACATGGCCCAGGTA 

CP . CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
yj TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
') f | AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 

ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAGT CCACCAGGTACTGAAGTGAC TAGAC TGTCAGCAGAGGAT GCAGAT GCCCCCGGC TC CCCCA 
K s ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
g CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
H CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
i« TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGGCATGCTAACAGCCATTGATGCTGA 
Q CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACAGAGAAGGGACTTTTG 
Uj GCCTGGAT TGGGAGCCAGACT C TGGGCAT GT TAGAC T CAGACT C TGCAAGAACCT CAGT T ATGAG 

f\ GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
%\ AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
^ ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCA(^CCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCA.CCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCA.CAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
T CC CC TGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGC CCC TCCA 
TCT GCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCC TGGACACCAACT T 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHP PGP FEVNAEGNL YVTRELDREAQAE YLLQVRAQNSHGE DYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWWVQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPI SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFS GEVHTAQSLQGAQPGDTYTVLVEAQDTA1TLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI I P WVSHNAQMWQL LTOVI VCRCNVEGQCMRKVGRMKGMPTKLSAVG I LVGT LVAI G I FLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 



GGCTGACCGTGC TACAT TGCCT GGAGGAAGCC TAAGGAACCCAGGCATCCAGC T GCCCACGCCTG 
AGT CCAAGAT TC TTCCCAGGAACACAAACGTAGGAGACCCACGC TCCTGGAAGCACCAGCC TTTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATC T T GCT TTCCAGCCACAAAGAGACAGATGAAGAT GCAGAAAGGAAATG 
T TC TCCT TATGT TTGG TC TAC TAT TGCAT T TAGAAGC TGCAACAAAT TCCAATGAGAC TAGCACC 
T CT GCCAACACTGGATCCAGTGTGAT CTC CAG T GGAGCCAGCACAGCCACCAAC TCT GGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAG T CACCAAC TCT GAGT T CCATACAACCT CCAGTGGGAT CAGCACAGCCACCAACT C T GAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAACTCT GAGTCCAGCACACCCTCCAGT GGGGCCAGCACAGTCACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
~, AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
5{ CAACT CT GACT CCAGCACAACCTC CAG TGGGGC TAGCACAGCCACCAACTC TGAGTCCAGCACAA 

M CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
pi GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
\jl AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
jfl GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
%l TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
'41 G GC TAGCACAGCCAC CAACT C T GAC T CCAGCACAACC TCCAGT G GGGCCGGCACAGC CAC CAACT 

M CTGAGTC CAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAAT TC TGAGTCCAGCACACCCTCC 

g AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
r ff\ CAAC T CT GAGTCCAGCACAGT GTC CAGT GGGGC CAGCAC T GCCACCAAC T C TGAG TCCAGCACAA 

fjj CCT CCAGTGGGG TCAGCACAGCCACC AAC TCT GAG T CCAGCACAACC TCCAGT GGGGCTAGCACA 

U i GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
CI CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
U j ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
j£I ATGCACACAAC TTCCCATAGT GCATC TAC TGCAGTGAGT GAGGCAAAGCC TGGT GGG T CCCTGGT 

JJ" GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
I - TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCA^GTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCTTGAAGAAGGTATTCCTCACCTTTC TTGCCT TTACCAGACACTGGAAAGAGAATACTATAT 
TGCTCATT TAGCTAAGAAAT AAAT ACATC TCATCTAACACACACGACAAAGAGAAGCT GT GCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT C TGGCAT TCAAAATC TCCACAG T AAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAA?W^AAAAAAAAAAAAAAAAA?VA 
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FIGURE 100 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSSSSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGM.TTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATGAGGCTGAAAAACA.GAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGAT GAT GT GATATATGCAT TC CAGGGGAAGGGAAAT TGT GGT GC TT C TGAACC CATGG T CAAT T 
AAC GAGGCAGT T TC TAGC TACT GCACGTACTTCATAAAGCAGGAC TC TAAAAGCT T T GGAAT CAT 
GGT G T CAT GGAAAGGGAT T TACT TTATAC TGAC TCT GT T T TGGGGAAGCT T T TT TGGAAGCATT T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
^ GATTATAACTGGGGATGCATTTGTTCCTGGAGAAAGAAGTGTCATTATCATGAACCATCGGACAA 
%i GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
fjj TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
lj| CT T CATT CATAGGAAATGGAAGGATGACAAGAGCCAT TTCGAAGACATGAT TGAT TACT TT TGT G 

01 AT AT T CACGAAC C AC T T C AAC T C C T CAT A. T T C C C AG AAG G GAC T GAT C T C AC AG AAAAC AG CAAG 

TC T CGAAG T AATGCATT TGCTGAAAAAAAT GGACT T C AGAAATATGAATATGTT T T AC AT C CAAG 

3 

AAC TACAGGC T T TAC T T T T GT GGTAGACC GTC TAAGAGAAGGTAAGAACC TT GAT GC T G TCCAT G 
jjj ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
Q CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
LlJ CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
f'\ GGGAGAAGAATT TT TATT T TAC CGGACAGAGTG TCAT TCCACCT TGCAAG TC T GAACT CAGGGTC 

CTTGTGGTCAAATTGCTCTCTATACTGTA.TTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATAT TTGTACAG TC T TGT TAAGTGGTAT TT TATAATCACCATT GTAATCT T TG T GCTGCAAGAGA 
GAATATT TGGTGGACTGGAGAT CATAGAACTTGCATGTTACCGACTT TTACACAAACAGCCACAT 
TTAAATT CAAAGAAAAATGAGTAAGATTATAAGGTT T GCCATGTGAAAACCTAGAGCATATTTTG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCAT TAT TTG T TAAAGATAT T TTGCAC TTAATT TT GT GGGAAAAATATTGCT AC AAT T T T T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATAT TAT TAAACAATCAT CAGGC T T TTAAA 
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FIGURE 102 



MHSRGREIVVLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCIiMRYSYLRLEKICLKASLKGVPGFGWAM^^ 

LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFT FWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVI PPCKSELRVL WKLLS I LYWTL FS PAMCLLI YLYS LVKWYFI I T I VI FVLQERI FGG 
LEIIE LAC YRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 ■ 
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FIGURE 1Q3 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAS.GTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CAC C T C TACAGAGAT GGGGAAGACT GGGAATCTAAGC AGAT GC CACAGTATCGAGGGAGAAC T GA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
ATT AT AGTCCAGGAAAAT GC T GGGAGCATATT G TG T T CCAT CCACC T TGC TGAGCAGAG TCAT GA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAAT C GAAAGGGAAAAT CCAGGCGGAAC TGGAC TGGAGAAGAAAGCACGGACAGGCAGAAT TGAG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
T T T C T GAT C T GAAAAC T G T AAC C C AT AG£ AAAG C T C C C C AG G AG G T GC C T CACT C T GAGAAGAGA 
TTTACAAGGAAGAGTGTGGTGGCTTCTCA.GGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
TGACATGTCAGT TTGAAGGCT T GTTGAGACCCTATAT CCAGCATGCGATGTATGACGAGGAAAAG 
GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
C CCACACCACAGAC CCAGACACAGCCAAG GGAGAG T GCT CC CGACAGGTGGCCCCAGC TT CCTCT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCAT GGC TGCCCTGAAGTGGGGACGGAATAGAC TCACAT TAGGT T TAGTT T GT GAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAG T GAT T CCTGCCTCACAGGT GAAGAT TAAAGAGACAACGAATG TGAATCAT GCT TGCAGGT T 
TGAGGGCACAGTGTT TGC TAATGATGTGTT TT TATAT TATACAT TT TCCCACCATAAACTCTGTT 
TGCTTATTCCACATTAATT TACTTTTCTCTATACCAAAT CACCCATGGAATAGT TAT T GAACACC 
T GCT TT GTGAGGCTCAAAGAATAAAGAGGAGGTAGGATT T T TCACTGAT TCTATAAGCCCAGCAT 
TAC CT GATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAAC TACAGGTC CATATCC 
C TCAT TAACACAGACACAAAAAT T C TAAATAAAAT T T TAAC AAAT TAAAC TAAACAAT AT AT TTA 
AAGATGATATATAACTAC TCAGTGTGG T T TG T CCCAC AAATGCAGAGTT GGTT TAATAT T TAAAT 
AT CAACC AGT GT AAT TCAGC AC AT T AATAAAGTAAAAAAGAAAACCAT AAAAAAAAAAAAAAA 
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FIGURE 10 J. 

MAFVL I LVLS F YELVS GQWQVTGPGKFVQALVGEDAVFS C S L FPE T SAEAMEVRFFRNQ FHAWH 
LYRDGEDWE SKQMPQYRGRTE FVKDS I AGGRVS LRLKNI TPSDI GLYGCWFS SQ I YDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LCS IHLAEQSHEVESKVLIGETFFQPS PWRLAS I LLGLLCGALCGWMGMI I VFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WAS QG FQAGRH YWE VDVGQNVGW YVGVCRDDVDRGKN NVTL S PNNGYWVLRL T TEHLYFT 
FNPHFI SLPPSTPPTRVGVFLDYEGGT I S FFNTNDQSL I YTLLTCQFEGLLRPYI QHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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CCTTCACAGGACTCTTCATTGCTGGTTGGC AiVTGA TGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
T G AC AAACT AT ATGCTGAGT TT GGCAGAGAG GCT TCTAAC AATTTT ACAG AAAT GAGCCAGAGACTTGAAT 
CAAT GGTG AAAAAT GCATTT TATAAATCTC CATT AAGGGAAGAAT TTGTC AAGT CTCAGGTT AT CAAGTT C 
AGTC AAC AGAAGCATGGAGT GT TGGCTC AT AT GCT GTT GATT TGT AGATT T CACTCT ACTGAGGATCCTG A 
AACT GT AG AT AAAATTGT TC AACT TGTT T T ACAT GAAAAGCT GC AAGATGCT GT AGGACCCC C T AAAGT AG 
, AT CCTC ACT C AGTT AAAATT AAAAAAAT CAACAAG ACAGAAACAGACAGCT ATC T AAACCAT T GCTGCGG A 

'f* \ AC AC GAAG AAGTAAAACTCT AGGT C AGAGT CT CAGGATCGTT GGTGGGAC AGAAGT AGAAGAGGGTGAATG 

r ; i GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
ff\ TT GT GAGT G C TG CT CACTGT T T TAC AAC AT A 1 ]? AAGAACCCTGCC AG AT GG AC TGCT TCCTTT GGAGTAACA 

yj AT AAAACCT T CGAAAATG AAAC GGGGT CTCCGGAGAAT AAT T GT CC AT GAAAAAT AC AAACACCC ATC ACA 

!J f T G AC TAT GAT AT T T CT CT T GCAGAGC T TTCTAGCC C TGTTCC CT AC AC AAAT GC AG TAC ATAGAGT T TGTC 

TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
r ' ??? GATGGTTACAGT CAAAATCATCTT CGACAAGCACAGGTGACT CTC AT AGACGCTAC AACTTGCAATGAACC 

asi TC AAGC TT ACAATGACGCCAT AACTC CT AG AAT GTT ATGTGCTGGCTC CT T AGAAGGAAAAACAG AT GC AT 

J f * ' GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 

m 

*»* AGCT GG GG AGAT GAATGTGCGAAACCCAAC AAGCCTGGTGTT T AT ACTAGAGT T ACGGCCTT GCGGGACT G 

M 

\ z \ GATTACTTCAAAAACTGGTATCTAAGAGACA?iAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
g;| GGT GTGGAGGCC AT TTTT AGAGAT ACAGAATT GGAGAAGACT TGC AAAAC AGCT AG ATTTGACT GAT CTC A 

f|| ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACTCT GT CATCTGTGAGCAAT AGT TGAAACTTT ATGTAC AT AG AG AAAT AG AT AATACAAT ATT AC 
AT T ACAGCCTGTAT TCATTT GTTCTCTAGAAGrTT T TGT CAGAAT T TTG AC TT GT T G ACAT AAAT TT GT AAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGTATTAGGTGTTTTTCTTAGTGGAATATTAGAAATGATCATATTCATTATGAAAGGTCAAGCAAAGACA 
GCAGAATACCAATCACTTCATCATTTAGGAAG^TATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
AT AT AT CCTT AT TT TC AT TT CC AAAC AACT AC TAT GAT AAAT GT GAAG AAGAT T CTGT TTTT TT GTGACCT 
AT AAT AATTAT ACAAACT TCATGCAATGT ACT' TGTTCT AAGCAAATTAAAGC AAAT AT TT AT TT AAC ATTG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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FIGURE 106 

MWRPDVVRARKRVCWEPWIGLVIFISLIVLAV^ 

AEFGRSASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLMMLLICRFHSTE 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TE VEEGEWPWQASLQWDGSHRCGATL INATWLVSAAHC FT T YKNPARWTAS FGVT I KPSKMKRGL 
RR 1 1 VHE K YKH P S HD YD I S L AE L S S P VP Y T NAVHRVC L P DAS YE FQ P G DVM FVT G FGALKNDG Y S 
QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCT CCGGCTC TCCGCGAAGAAGT TCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGAT CGC TGCCGTTT TGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCT CCC T T 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GAT AT GGTCAC T T TGAAAGCCAGGAT GACAAAT TATGGAT TACCTAGATATCGGT GGC T TACTCA 
y± TGCTTGGAATTTTTTTCAGAGAGAGTTT^AGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
f\ AAATGAC AGAGATGGACT GGCC CC CAGAT TCC TGCTGTGT TAGAGAAT TC CCAGGAT GT TCCAAA 

%t CAGGC CCAC CAGGAAGAT CTCAGTGACC T T TATCAAGAGGGT T GT GGGAAGAAAAT G TATTCC T T 

y TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
Ul TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
Uj ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
|f| G T T GAAACC AAGC C T G T C AAGAAT CT TTG AAC ACAC AT C CAT GGCAAACAGC T T TAAT AC AC AC T 

T T GAGAT G GAGGAG T TATAAAAAGAAATGTCACAGAAGAAAAC CACAAAC T T GT T T TAT T GGAC T 
T G T GAAT T T T TGAGTACATAC TAT G T GT T T CAGAAAT AT G T AGAAATAAAAATGT T GC CATAAAA 
N T AAC AC C T AAGC AT ATAC TAT T C TAT G C T T TAAAATG AGG ATGGAAAAGT T T CATGTC AT AAG T C 

is ACCAC CT GGACAATAAT T GAT GC CC T TAAAATGC TGAAGAC AGATGTCATAC CC ACTGTGTAGCC 

f'S TGT GTAT GAC T T TTAC T GAACACAG T TAT GT T T TGAGGCAGCATGGT T TGAT TAGC AT T T CCGCA 

\u TCCATGCAAACGAGTCACATATGGTGGGACTGGAGCCATAGTAAAGGTTGATTTACTTCTACCAA 
t!f C TAGTATATAAAGTAC TAAT TAAATGC TAACATAGGAAGTT AGAAAATAC TAAT AAC T T T TATTA 

P C TCAGCGATCTATTC T TC TGAT G CTAAATAAAT TATATAT CAGAAAAC T T TCAATAT T GGTGAC T 

Lf| ACCTAAATGTGATTTTTGCTGGTTACTAAAATATTCTTACCACTTAAAAGAGCAAGCTAACACAT 
Hj TGT CT TAAGCTGATCAGGGAT T T T T T G TATATAAGT C TGTGT TAAATC TGTATAAT TCAGTCGAT 

%\ TTCAGTT CT GATAAT G TTAAGAAT AACCAT TAT GAAAAGGAAAATT TGTCC TGTAT AGCATCAT T 

I -W ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TATTTAAATAC T TAACCACTAATTT TGAAAATTACCAGTGTGATACATAGGAATCATTATTC 
AGAAT G TAGTC TGGTC T T TAGGAAGT AT TAATAAGAAAAT T T GCACATAAC T TAG TT GATTCAGA 
AAGGACT TGTATGC TGT T T TTCT CCCAAA.TGAAGAC T CT T T T T GACAC TAAACAC TT T T TAAAAA 
GCTTATCTTTGCCTTCTCCAAACAAGAAGCAATAGTCTCCAAGTCAATATAAATTCTAC^ 
TAGTGTT CT T T T TC TCCAGAAAAATGCT T GTGAGAAT CAT T AAAACATGTGACAATT TAGAGAT T 
C TT TGTTTTAT TTCACTGATTAATATACTGTGGCAAATTACACAGATTAT TAAAT TTTTTTACAA 
GAGTATAG TAT AT T TAT T T GAAAT GG GAA AAG T GCAT T T TAC T G TAT T T T GT G TAT T T T GT T TAT 
TTCTCAGAATATGGAAAGAAAAT TAAAATGTGTCAATAAATAT TTTCTAGAGAGTAA ' 
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MARE DS VKC LRCLL YALNLL FWLMS I S VL AVSAWMRDYLNNVL TLTAETRVEEAVI LT Y FPWHP 
VMIAVCCFL I IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQEIMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLS DLYQEGCGKKMYS FLRGTKQLQVLRFLG I S I GVTQ I LAMI LT I TLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

'r e * amino acids 1-33 

Wi ■ • 

Transmembrane domains: 




amino acids 12-35, 57-86, 94-114, 226-248 
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ul 

■ \V\ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAACACTGACCACCAGTT GGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGT GAAT GGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CT GACCAAGCCCAGCGT CTGCTTCCAC GT CTACTGTGGT C ATTT TT ATGACATCTGCGAC GAGGACTGCCAT GG 
CAGCTGCTCAGATACCAGCGAGTGCACATGCGCTCCAGGAACTGTGCTAGGCCCTGACAGGCAGACATGCTTTG 
ATGAAAAT GAATGT GAGC AAAACAACGGT GGCT GCAGTGAGATCTGT GT GAACCTC AAAAACTCCT AC CGCTGT 
GAGT GT G GG GT T GGC C GT GT GC T AAGAAGT G AT GG C AAGACT T GT GAAGAC G TT GAAGGAT GC C AC AAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGC GAGT TTCCAC GCCT GTACACCAT TTCTGAAGGATACGT TCCCAACC TT CGAAACTCCCCACT GGAAAT 
CATGAGGCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGT GAGC GGC 
TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCC CT GT CTT C AAGT TT GT GGGC AAAGACCAC AAGGAAGT GT T T CT GCACT GCC GGGT TC TT GT CT GT GGAGT G 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCT AC AGGGCCAGAC GCTAACAGGC GGCC CGATC CGC AT C GACT GGGAG GACTAGT T CGT AGCC AT AC CT C 

GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTC AGACTT CACACT GTGAGTTCAGACTCCCAGCACCAACTCACT CTGATTC TGGT CCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CACAAAATCAGAAGCTGGGTATAATAT TTCAAGTTACAAACCCTAGAAAAATTAAACAGTTACT GAAATTATGA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TTATAGGGAATTT GGAAGTGTATCAAT AAAACAGTATAT AATT T T 
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MPP FLLLTCL FI TGT S VS PVALDFCS AY I SLNEPWRNT DHQLDE SQGPPLCDNH VNGEWY HFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVIiGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV' 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCEC PRGLVLSE DNHTCQVPVLCKSNAI EVNI PRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSECLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LV CG VL DE RS RCAQ GC HR RMRRGAGG ED S AGIj QGQTLT GGP I RI DWE D 

Important features of the protein: 
Signal peptide : 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGC AGCT TGCT CAGCGG AC AAGGAT GCTGGGCGTGAGGGACCAAGGCCTGCCCTGC ACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCT GCCT TGGGGT GACAATCT C AGCTCCAGGCT ACAGGGAGACCGGG AGGATCAC AG AGCCAGC ATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GGAG ACCT T C AGAAAGGT GGGGATCCCC AT CATC AT AGCACT ACTG AGCCTGGCGAGT AT CAT C ATTGT 
GGT T GTCCTC AT CAAGGTGATT CT GG AT AAAT ACT ACTTCCT CT GCGGGCAGCCTCTCCACT TCATCCCGA 
GGAAGC AGCTGT GT GACGGAGAGC TGGACT GT CCCTTGGGGGAGGACG AGGAGC AC TGTGTC AAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTT CT CTGCCTGTTT CGACAACTTCAC AGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GC TACAGC AGAG CT GTGG AGAT T GGC CC AG AC' CAGG AT CTGGAT GTTGTT GAAATC AC AG AAAAC AGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TCAGCATCCAGTACGACAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCA 
GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 
CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
GATGTC TGAC AT AC TGCT GC AGGC GT CAGT CC AGGTC ATTGACAGC AC ACGGTGCAATGC AGACGATGCGT 
ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGSCCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGC AAGAGT CCCCTT GGGT AC ACCCCT CT GCCC ACAGCC TCAGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATT GCT AAGCC AAGAAG GAACTTT C CC AC ACT ACT GAATGGAAGCAGGCTGTCT T GT AAAAGCC 
CAGAT C ACTGTGGGCTGG AGAGGAGAAGGAAA GGGTCT GCGCCAGC CCTGTCCGTCTT CACCCAT CCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGTTGTCAT T GT T ATT ACAGCT ATGGCC ACT AT T ATTAAAGAGCTGTGTAAC AT CTCTGGCAAAAAAAA 
AAAA 
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mlqdpdsdqplnsldvkplrkpripmetfrkvgipi iiallslasi iivwlikvildkyyflcg 
qplhfiprkqlcdgeldcplgedeehcvk:sfpegpavavrlskdrstlqvxdsatgnwfsacfdn 
ftealaetacrqmgysrave i gpdqdldvve i tens qelrmrns sgpcls gs lvslhclacgksl 
kt prwggeeas vds w pwqvs i q ydkqhvcggs i ldphwvltaahc frkhtdvfnwkvrags dkl 

GS FPSLAVAKI 1 1 IEFNPMYPKDNTDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWI IGWG 
FTKQNGGKMS D I LLQAS VQVI DS T RCNADDAYQGE VT EKMMCAG I PE GGVDTCQGDS GG PLMYQS 
DQWHWG I VS WGYGCGGP S T PGVYTKVS AYLNW I YNVWKAE L 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGC T AGGAT T C TGACTATGCT GTG GTGGCTAGTGCTCCTAC TCCTACC TACAT TAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
AT C CAT GGGC T AAT C T GAAC T G T GT C CCA&GGAACC C AGAGC T T GAGT GAGC T G T GGCTC AGACG 
C AG AAGGGGTC T GC T TAGAC CACC TGGT T TAT G T GACAGGAC T T GC AT T C T CC T GGAACAT GAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACT TGT GCCAAAT TATGGG TCAGAAAAGAT G 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
C TC CT TC CCTC TGAGAGGCCCT CCTATGTCCC TAC TAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGkGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAATGATCTCCAAGGGCCC T TAT ACCCCAGGAGACTT T GAT T TGAAT T T GAAACC CCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAA Z \AAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGCTGAGACAGGAGA^TTACTTGAACCTGGGAGGTGAAGGAGGCTGAGACA 
GGAGAATCAC T TCAGCC TGAGCAACACAGCGAGAC T CTGTCT CAGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 
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FIGURE lid 



MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 



Signal peptide: 

amino acids 1-15 



m 
m 
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FIGURE 11s 



u 
hi 



CAGC^TGGTCTCTC^GTCCTCTCAAAGCAAGGAAAGAGTACTGTGT 

AGAAT CC TCCAGAGAATT G TGAAGACTGTCACAT TCTAAAT GCAGAAGCT T TTAAAT CCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCTACAGCAATGGAGAGAAGAAGAAGAT TTACATGGAAAT TGATCC TGTGACCAGAACTGAAATA 
T TCAGAAGCGGAAAT GGCACT GAT GAAACATTGGAAGTGCACGACT T TAAAAACGGAT ACACTGG 
CATCTAGTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAAT TACCACAACTTTCT TTGAACAGTCAGTG 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GAT T T GT GATAACGTGACCATGTAT T GGATCAATCCCACTCTAATATCAGT TTC TGAGT T ACAAG 
AC T T T GAG G AGG AG GG AGAAG AT C T T CAC TTTCCTGC CAAC G AAAAAAAAGG GAT T GAAC AAAAT 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAACTT CCAATAAATGACTATACTGAAAATGGAATAGAAT TT GATCCCATGCT GGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGGATGCTGGGGAGGGT CTAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TAT CAC CAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HT FYSNGEKKK I YME I DPVTRTE I FRS GNGTDETLEVHDFKNGYTG I YFVGLQKCFIKTQIKVTP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCC GGCT GC AT CGC GGCCACCGGGATGGACATGTG GAGC ACCCAGGACCT GTACGAC AACC CCGTC ACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCT GT GTTT GCC AACATGCT GGTGAC T AACTT CT GG ATGT CC AC AGCT AACAT GT AC AC CGGCAT GGGTG G 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
'%t TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 

C-J 

GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
j . ; TGGGT CC AACACCAAAAACAAGAAGAT AT ACG AT GG AGGTGC CC GC AC AG AG GACG AGGT AC AAT CT T AT C 

;f| CTTCCAAGCACGACTATGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
rff CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
M CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 

£ GCT GAGTT AT TT AT GAAT TAGAGGCT AT AGCT' CAC ATTTTCAAT CCTCTATT TCTTTTTTTAAATATAACT 

as a 

Usi TT CT AC TC TGAT GAGAGAAT GTGGTTTTAATC! T C TCTCTC AC AT T T TG AT GATT T AGACAGACT C CCCCT C 

i I '! 

TT CCT CCT AGTC AAT AAACCCAT T GATGAT CT AT TTCCC AGCTT AT CCCC AAGAAAAC TT TT GAAAGGAAA 
! GAGT AGACCCAAAGATGTTATTTT CTGCTGTT 1 TGAATT TT GT CT CC CCACCCCC AACTTGGCTAGTAATAA 

AC ACT T ACTGAAGAAGAAGC AATAAG AGAAAG AT AT TT GT AATCTC TCC AGCCC AT GATCTCGGT TTTCT T 
ti\ ACACT GTGAT CT TAAAAGTT ACCAAACCAAAG T C ATTT TCAGT T TG AGGC AACCAAACCTTT CT ACTGCT G 

TT GAC ATCTT CT TATT AC AGCAACACC ATT CTAGGAGT TTCC T GAGCTCT CC ACTGGAGTCCTCTTTCTGT 
CGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAATTTAAGTCCTAAATATAGTTAA 
AAT AAATAATGT TT T AGT AAAATGAT AC ACTATC TCTGTGAAAT AGCCTC AC CC CT ACATGT GGAT AGAAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
ATGAAAAGCTCACACCTGTAATCCTAGCACTTTGGGAGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
TGGTGGCATACACCTGTAGT CCCAGC AT TCCG GGAG GCTG AGGT GGGAGG AT CACTTG AGCCCAGGGAGGT 
TGGGGCTGCAGT GAGCCATGAT CACACCACTG CACTCCAGCC AGGTGAC ATAGCGAGATCCT GT CT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMIaTSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFT ILGLPAMLQAVRALMI VGIVLGAI GLLVS I FALKCIRI GSMEDSAKANMTLTSGIMFIVSGL 
CAIAGVSVFANMLVTNFWMSTANMYTGMGGMVQTVQTRYT FGAAL FVGWVAGGLTL I GGVMMC IA 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 



Signal peptide: 
amino acids 1-23 



pp Transmembrane domains: 



amino acids 81-100, 121-141, 173-194 



UJ 

rji 

: 5 a 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAG^yVCCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGAT AATG GCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAA.CATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTG^iAGGCTCACATTCTGCTGACGGCTGGA^TCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
% ACTCAATAGTGAATGTTGCCCAAAAACGT^GCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
m CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
01 CAGATAC TCGATACCTT CCC ATCGCACAACCCAAAAAAGT TATCACACC GGAAAGAAGTCACCGAGC G 

U| TCT ACTC CAGAAGTCAGTAT GTGTAGTT GTGTATGT TTT T T TAACTTTACTATAAAGCC AT GCAAAT G 

IM ACAAAAAT CTAT ATTACT T T CTCAAAAT GGACCCCAAAGAAACT T T GAT TTACT GTT CT TAACTGCCT 

m 

AAT C T TAAT TAC AGG AAC T GT GCAT CAG C TAT T TAT GAT T C TAT AAG C T AT T T CAGC AGAAT GAG AT A 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAA.GGTGGTTCA^GCATCTA 
f»j| CT CTT TT TATCAT TTACTT CAAAATGACATT GCTAAAGACT GCAT TATT TTACTACTGTAATTT CTCC 

fjl ACGACATAGCATTATGTACATAGATGAGTGTAACATTTATATCTCACATAGAGACATGCTTATATGGT 
CJ TTTATTTAAAATGAAATGCCAGTCCATTACi^CTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 
UJ AT CATGGATAGGGTTGAAGAAGGTTAC TATT AATT GTTTAAAAAC AGCT TAGGGATTAATGTCCTCCA 

W TTTATAATGAAGATTAAAATGAAGGCTTT^TCAGCATTGTAAAGGAAATTGAATGGCTTTCTGATAT 
■^ GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAiiGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATAT TT TTGTTT TT GTAT TTGAAGAAGAATGAT GCAT T TTGACAAGAAATCATATATGTAT GGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAA^^CAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTi^AATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACAT TTATATTAATAAATTGTACATT TTTCTAATT 
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FIGURE 120 

MATHALEIAGLFLGGVGMVGWAVTVMPQ 
IYDSLLALSPDLQAARGMCAASVMSFLAF^^ 
MWLIPVSOTANAIIRDFYNSIVWAQKRELGEALYLGW 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
C TCCGGCAGAGGGAGGT GGT GGACC TGTATAAT GGAATGTGCT TACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGTCCTGGACACCCAACTAC 
M AAGCAG TG TTCATGGAGTTCATTGAAT TATGGCATAGAT CTT GGGAAAAT TGCGG AG T GT ACAT T 

P TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
jgj GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
T\ ATT G AAG C T AT AAT T TAT T T G GAC CAAGG?\AGC C C T G AAAT GAAT T CAACAAT T AAT AT T CAT C G 

jj| CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
fl| TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
M AT TAT T GAAGAAC T ACCAAA ATAAA T GC T T TAAT T T T CAT T T GC TACC T C T T T T T T TAT TAT G CC 

* TTGGAAT GGT TCAC T TAAAT GACATT TTA*\ATAAGT T TATGTATACATC TGAAT GAAAAGCAAAG 

P 

,W C TAAAT AT G T T T AC AG AC CAAAG T G T GAT T T C AC AC T G T T T T T AAAT C T AGC AT TAT T CAT T T T G 

y ^ 

r 4 1 CTT CAAT CAAAAGTGG T T TCAATAT T TT TT TTAGT TGGT TAGAATACTT TC TTCATAGTCACAT T 

Ill CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 

r ; \ 

AAAAAAT AT AAAAGCTACCAAT CT T TGTACAAT TTG TAAAT G T TAAGAAT T TT T T T TATATCTGT 
N TAAAT AAAAAT TAT T TCCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGS PGANVI PGT PG I PGRDGFKGEKGECLRES FEES WTPNYKQCS WS SLNYGI DLGKI AECT FTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
S VE GLCE G I GAGLVDVAI WVGT C5 D YPKGDAS TGWNS VS R 1 1 1 EE L PK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GCCTGGAAGAATACATCAT GT T T T TCGATAAGAAGAAAT T GTAGGATCCAGTT T T T TT TT TA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCC CC TAAGCCAAAGCAAAAGACC TAAGGACGACC T T TGAACAATACAAAGGATGGG T T T CAATG 
TAAT TAGGCTAC TGAGCGGATCAGCTGTAGCAC TGGT TATAGCCCCCACTGTCT TACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
L- . ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
^ ATAACAGCCTTCAAAAAC TTAAGTAT AATCAAT TTAAAGGGCT CAACCAGCTCACC TGGCTATAC 

Q C TT GACCATAACCATAT CAGCAATATTGACGAAAAT GCT TT TAATGGAATAC GCAGACTCAAAGA 

C| GCT GAT T CT TAGT TCCAATAGAAT CT CC TAT T T T C T TAACAATACC TTCAGACG TGTGACAAAT T 

rp TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
l'^ CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
Ht AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
TCT T T GC TGGCAT GAT CAGAC TCAAAGAAC T T CACC T GGAGCACAATCAAT T T T CC AAGC TCAAC 
01 CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
C AT AGGACAGACCATGT CCT G GACC T G GAGC T CC T TACAAAG GC T T GAT T TAT C AGGC AATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
^ T C C AACAAG C T C AC AT T TAT TGGT C AAGAGAT T T T GGAT T C T T G GAT AT C C C T C AAT GAC ATC AG 

** TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
tfl TTAAAGGTCTAAGGGAGAATACAAT TATC TGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 

P AT C GAT GCAG T GAAGAACTACAGCATC TG TGGCAAAAG T ACTACAGAGAGGT T T GAT CT GGCCAG 

i7i GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
%l TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
fl.l CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGT CCC TAAAGCAAATGACT CC CAGCACCCAGGAAT TT TATGTA 
GATTATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCATTGT GATAAAAAGAGC TCT T AAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GG T CAT T T T CCTCTCATACATAATCAACCCAT TGAAAT T TAAATACCACAATCAAT GTGAAGC TT 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
G T T T T AAG AT AAAAC T T C T T T CAT AGG T AJ^AAAAAAAAA 
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FIGURE 124 

MGFNVIRLLS GSAVALVI APTVLLTMLS SAERGCPKGCRCEGKMVYCESQKLQE I PS S I SAGCLG 
LSLRYNSLQKLKYNQFKGLNQLT WLYLDHNHI SNI DENAFNG IRRLKEL I LS SNRI S YFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARWFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKSFKGLRENTIICASPKELQGVNVIDA^raTYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHI SFHKI IAGSVALFLSVLVI LLVI YVSWKRYPASMKQLQQR 
S LMRRHRKKKRQSLKQMTPS TQE FYVD YKPTNTE T SEMLLNGT GPCT YNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 4 65-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12K 



CCGTT AT CGTCT TGCGCTACTGCT GAATGTCCG TCCCGGAGGAGGAGGAGAGGCTT T TGC CGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T GAAGAGGAAGGCTT T C T AAAGC T TT GGCAAGGAGTGACACCCGCCAT T TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATG^CATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
i . T TAG C CAAT C CAAC T G AC C TAG T GAAGG T TC AG AT G C AAAT G GAAG GAAAAAGG AAAC T GGAAGG 

Q AAAACCAT TGCGATTTCGTGGTGTACATCATGCAT TTGCAAAAATC T TAGCTGAAGGAGGAATAC 

m 



GAGGGCT T T GGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATT TA 
AC C AC T TAT G ATAC AG T G AAACAG T AC T T GG TAT T G AAT AC AC CAC T TGAGGACAAT AT CAT G AC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
rp TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACT TT TG TAT AAAT CATCG 

j*?* AC T GAC T GC T T GAT T CAGG C T GT T CAAGG T GAAGGAT T CAT GAGTC TATATAAAGGC T T T T T AC C 

jS ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 

**$ TGAGTGGAGTCAGTCCATTTTAA 
! 11 

i»5 

UJ 

a 

m - 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAMVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGQFLANPTDLVKVQMQMEGKRK^ 

NIQRAALVNMGDLTTYDTWHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGftATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCGTACAGATGTGACCTAT 
CAAAT GAAGAGGAC ATCC T C TCCAT GT T C T C AGCT AT CC G T T CT CAGCACAGCGG T G T AGACAT C 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
fa GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
^ AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
% CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
\jl GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCAC'GTGCATCTCTCCAGGTGTGGTGG 
r{ 1 AGACACAAT TCGCCT TCAAACT CCACGACAAGGACCC T GAGAAGGCAGCTGCCACCTATGAGCAA 

0.1 ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
^ CATCCAGAT TGGAGACATCCAGAT GAGGCCCACGGAGCAGGTGACC TAG TGAC TG TGGGAGCTCC 

r 1 

ill CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 



TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 



TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGAT TTCATGGTGATCAT TAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 



App_ID=10063581 



Page 279 



FIGURE 128 

MARPGMERWRDRLALVTGAS GG I GAAVAR^VQQGLKVVGCARTVGN IEE LAAE CKS AG YPGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGL^PDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQM3KCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 



Important features of the protein: 
Signal peptide: 

Islt amino acids 1-17 

: 

y N-myristoylation sites. 

Y;\ amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 

jlf 199-205 

S 

M Short-chain alcohol dehyrogenase. 

amino acids 30-42, 104-114 

CI 

GI 
yj 

.ffi : 
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FIGURE 12Q 



AACT X C TACATGGGCCTCC'i'GCIGC TGGTGC TCTTCCT CAGCCTCC TGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACGTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGi^AAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T TAAT T T TT AACTGATAG TT G TACAT AT T TGGGGGTACATGTGATAT T TGGATACAT GTATACAA 
TAT AT AATGAT CAAATCAGGG TAACT GGGATATCC ATCACATC AAACAT T TAT T TT T TAT TCT T T 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
G C AC C AC AAT GC C C AAC TAAT T T T T G TAT T T T TAG TAG AG AC GGGGTTTTGC CAT G T T G C C C AGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 

ill 

%l GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 

m 

U< r ACAATGAATtATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 

$ ATCTAACTGTATATTTGTACCAGTTAACCi^ACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 

Q AACCT C T GAT CACC TCAT TC TACT CTCTACC TCCAT GAGAT CCAC T T T T T TAGC TCCCAC ATG T G 

%l AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 

T\ TTCCATCCAT GT T GCTGCAAATGACAGGATT TCGT TCT TAAT T TCAAT TAAAAT AACC ACACATG 

% GCAAAAA 

m 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRS FCVCAVHFS SHQPPVAVECLK 



Important features of the protein: 
Signal peptide: 

amino acids 1-18 



N-myristoylation site. 



Isj amino acids 86-92 



i'.j Zinc carboxypeptidases, zinc-binding region 2 signature. 

Ill amino acids 68-79 
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FIGURE 131 



TTCTGAAGTAACGGAAGCTACCTTGTATAAAGACCTCAACACTGCTGACCATGATCAGCGCAGCCTGGAGC 
AT CTTC CTCATCGGGACT AAAATT GGGCT GTT CC TTCAAGTAGC ACCTCT AT CAGTTATGGCTAAAT CCTG 
TCCAT C TGTGTGTC GCTGCGAT GC GGGTTTCAT T T ACTGT AATGATCGCTTT CT GAC ATCCATT C C AACAG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
TTGAAAAACTTGCTGAAAGTAGAAAGAATATACCTATACCACAACAGTTTAGATGAATTTCCTACCAACCT 
CCCAAAGT ATGT AAAAGAGT T ACAT T TGCAAG AAAAT AAC AT AAGG ACT ATCACTT AT GATT CACTTTCAA 
AAATT C CCT ATCTGGAAG AATT AC AT T TAG AT GAC AACTCT GTCTC TGCAGT T AGC AT AGAAGAGGGAGC A 
TTCCGAGACAGC AACT ATCT CCGACT G CTT TT CCTGTCCCGT AATC ACCT TAGC AC AATTCCCT GGGGTTT 
h*% GCCC AGGACT AT AGAAGAAC T ACGC T TGGATG AT AATC GCAT AT CC ACTATT TCATC ACCAT CT CTT C AAG 

QJ GT CTCACT AGTCT AAAACGCCT GGTT CT AG AT GGAAACCTGTTGAAC AATCATGGT TT AGGT GACAAAGTT 

TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 



m 
ui 



m 

at* GAT GAT TT GGAC AAT AT AAC AC AACT GAT T CT T C GC AACAAT CC CT GGT ATT GC GGGT GCAAGAT G AAAT G 

yj 

y ; GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
\, AGGT T C GT GGGAT G GCTATT AAGGAT CT C AAT GCAGAACTGTTT GATTGT AAGGAC AGTGGGAT TGT AAGC 

f* 1 AC CAT T CAGAT AAC CACTGC AATACCCAAC ACAGTGTATCCTGCCCAAGG ACAGTGGCCAGCTCCAGTGAC 

|jl CAAACAGCCAGATATT AAGAACCC C AAGCT CA CT AAGG ATCAAC AAACC ACAGGGAGTCCCT CAAG AAAAA 

CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 
h ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
M AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 
CC AT GGAAACCAGCAACCT CT ACCT ATT T GAT GAAACTCCTGTT TGTATT GAGACTGAAACT GC ACCCCT T 
CGAATGTACAACCCT ACAACCACCCTC AAT CG AG AGC AAG AGAAAG AACCTT ACAAAAACCC CAATTT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GCAGAAGCTGGC ACT AAG AAGGAC AACTCT AT C CTGGAAATC AGGG AAACTT CT TTTC AG AT GTT ACC AAT 
AAGCAATGAACCCAT CTCGAAGGAGG AGTT TGTAAT AC ACACCAT ATTTCCT CCT AATGGAATGAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TCACACTC ATGAT GCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 



ru 
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FIGURE 15*2 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVS I EEGAFRDSNYLRLL FL SRNHLS T I PWGLPRT I EELRLDDNRI STISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRKSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRL'DMSNNNLSN^ 

KVNVRGLMCQAPE KVRGMAI KDLNAE L FDCKDSGIVS T I Q I TT AI PNT VYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
I VT GERSE YLVTALE PDS PYKVCMVPME TSNL YL FDE TPVC I E TE TAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI IGGAVALVT IALLALVC WYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 
amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site, 
amino acids 34-44 
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FIGURE 133 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTG6GAGGAGGGCG 
TGCAGAGGCAGTCTGGGC TT GGCCAGAGCTCAGGG TGCT GAGCGTG TGACCAGCAGT GAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTiGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
C TC CACT ACCTCAAGCTGAGTGAT CC CAAGTACCTAAGAGAG T TCCAGC T GACC CT CCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GAC CACT CCAACAGAGCCAGGACT AT ATCAACC TC TTCTGCGCCAACATGATGGACT TGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAA.GCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTATTCAA 
TAT CAGCAGCAT T T T TCGAGGAGAG T GAAGAGGCGAGAAAAACAAT T TCCAGAT TCTCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCT GAAAAGCAC T T TCAAAGAGACTGTGT TGAATAAAGGGCCAAGG T TCT TGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGC TGAGACGGACGGT TCCATGCCAGCT GCC TGGAGGAGGAACAGACCCC 
T T T AGTC C T CATCCC TTAGATC CT GGAGGGCAC GGATCACAT C C T GGGAAGAAGGCAT CT GGAGG 
ATAAGCAAAGCCACCCCGACAC CCAATC T TGGAAGCCC T GAGT AGGCAGGGCCAGGGT AGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 



App_ID=10063581 



Page 285 of 320 



FIGURE 134 

MSARGRWEGGGRRACRGS LGLARAQGAERVTS SEQRPAMAS LGLLLLLLLTALPPLWSS SLPGLD 
TAESKAT IADL ILSALERATVFLEQRLPE INLDGMVGVRVLEEQLKSYREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
Y I NL FCANMMDLNRRAEAI GYAYPTRD I MEN IMFCGMGG FS D FYKLRWLEAI LS WQKQQEGC FG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
S S W D YRS VPPYLANF Y I FLVETGFHHVAHAGLELLI SRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site, 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 

GGTCT GAG X GCAGAGCTGCTG T CATGGCGGCCGCTCT GT GGGGCT TCTT TCCCGTCCT GCTGCT G 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCA?IGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGC T T ACAGAT T TGATCCCGT T CGAGTGGATATCACT TCGAAAGGAAAAATGAGAGCAAGATAT G 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACC TT CT TAC T T TAT TAAAAGGGAATCGTGGGGC TGGACAGAC T T TC TAAT GAACC CAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACATGAGACGGGAAATGGAGCAGTCAiHTGAATATGCTGAATTCCAACCATGAGTTGCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAG GTAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACAC TGGGTGGCATCCAAG TC TTGGAAAACC GT GTGAAGCAAC TAC TATAAAC T T 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAG G T CAAT AT T GAT G T CACT GAAT TAAT TACAGT G T CC T AT AGAAAAT G CCAT T AAT AAAT TAT 
ATGAACTACTATACATTATGTATATTAATTAAAACATCTTAATCCA.GAAATCAAAAAAAAAAVlA 
AAAAAAAAAAAAAAA 
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FIGURE 1*6 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAJ^GSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHD I PS GS YWEWS PAYRFDPVRVDI TS KGKMRARYVNY IKT SE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMMMVXPLLIFVLLPKVVNTSDPD 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 



Important features of the protein: 
Signal sequence: 

b4 amino acids 1-23 

Transmembrane domain: 

fl amino acids 161-182 

UJ 

m 

f|j N-glycosylation site. 

M amino acids 184-187 



tt\ Glycosaminoglycan attachment sites. 

fii amino acids 37-40, 236-239 

las* 

ui 

Q cAMP- and c(&lP-dependent protein kinase phosphorylation site. 

rS f 

M' f amino acids 151-154 



N-myristoylation sites, 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTAT ATGC GTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCA13ACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
. , AAGC TGT GT GATCGCCACAAACCT T C AGGAAATACGAAAT GGAT T T TCT GAGATAC GGGGCAGTG 

g] TGCAAGCCAAAGATGGAAACAT TGACATCAGAAT C T TAAGGAGGAC TGAG TCT T TGCAAGACACA 

Q AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
ft*. AAAC T AC C AG AC C C C T GAG CAT TAT AC T C T C C G GAAGAT C AG C AG C C T C G C C AAT TCCTTTCTTA 

i - i 

ffj- CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 

%\ AAGAAAT ACAGCCAGAT TCT GAG T CAC T T TGAAAAGCT G GAACC T CAG GCAG CAGT TGTGAAG GC 

U TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTG 

3 CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 

?*\ 

ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 

i?l 

CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
f' ! | TGTAATATCTT TCTGCTATTGGATATAT TTATTAGTTAATATATT TATT TAT TTTTTGCTATT TA 

I;!. ATGTAT T TAT T TT T T TAC T TGGACAT GAAAC TTTAAAAAAAT T CACAGAT TATATT TATAACC TG 

fll ACTAGAGCAGGTGATG TAT TT T TATACAG TAAAAAAAAAAAACCT TGTAAAT TCTAGAAGAGTGG 

CTAGGGGGGTTATTCATTTGTATTCAACTi\AGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGilTGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAAT TACTGACCATCCCCAGTAGAC 1CCCCAG T CCCATAATTG T GT AT C T TCCAG 
C CAGGAATC C TACACGGCC AGC AT G TAT T T C T AC AAATAAAG T T T T C T T T GC AT AC CAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 

SLANSFLTIKKDLRLSHAHMTCH^ 

E 



lis;; 



Important features of the protein: 
Signal peptide: 
amino acids 1-42 



CP cAMP- and cGMP-dependent protein kinase phosphorylation sites. 



amino acids 192-195, 225-228 



m 

m 

^ N-rnyristoylation sites. 

12 amino acids 42-47, 46-51, 136-141 

w 

11? - 

a ' 
hi 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GG T CAAAGGG T G T GAAAT TT AT GCCC TACACAACT TACC T T G T GGAAAAAGGAGCATCTCACAGT 
TT T G TGGC TGAAGCAAAGCCAC CT ACAGT TACTATGC CT CGAATCAAGGCATT GAT GACGGGGAG 
CCT T CCTGGCT T T G TCGACGTCAT CAGGAA.C C T CAAT TC TCCT GCAC TGC TGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAG TC T T T T AT GGAGAT GAAACC T GGGT TAAAT TA 
TTC CCAAAGCATT T TGTGGAAT AT GATGGAACAACCT CA.T T T T TCG TGTCA.GATTACACAGAGGT 
GGATAATAATGTCACGAGGC AT T T GGATA&AGTATTAAAAAGAG GAGAT TGGGACATAT TAATCC 
H* TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
p AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
g GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
SJ GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
^; CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
UJ ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
LSI CAAT GAGAGAGCAG T T GAGAT T T T T ACAT T T GAATACAGT GCAG CT T AGT AAAC T GTT GC AAG AG 

f|1 AAT G T GC CG T CAT AT GAAAAAGAT CCTGGGTTT GAG C AG T T T AAAAT G T C AG AAAG AT T G CATGG 

GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
r ~ TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
P TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
Q CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
; |-| CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
%l GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
f\ AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
UJ TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
CJ C^GGTGATTGTAAAGAGCTGGCGGTCAC?\GAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
rjj TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGG TCAAAAGGAGAC T T GGT CGCACCAC TCATCCTGCCACC CCCAGAAT GCATCC T 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACT CAGATCCACAGAGCCCAGGAT CAAGGGACCCACT GCAGTGGCAGCAGGACT GT TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAATTGAGAAAAAAAA 
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FIGURE 1AO 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKVVIVLI DALRDDFVFGSKGVKEMPYTTYLVEKGASHS FVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFTCYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 



Important features of the protein: 
Signal peptide: 

amino acids 1-34 



Transmembrane domain: 

amino acids 58-7 6 

is? f 

m. 

■ N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 



»i ■ amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 



275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCT TAAGACCACTCACACCT TCAGAGTGAAGAACTTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGT TCCAGATAAAAACTACATACGCCCAGAGATCTTCTTT GCAT TAGCCTCAT CCTT GAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GT T GGGGT GACAGATAAATT TGAGAACAGGAAACACAT T GAAT TT TCAT TTCAACCAGTT T GCAA 




AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 



AT T TGAACTAAT TGTATAAAAACACCAAACC TGCTCACT 
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FIGURE 142 

MLLLLIiE YNFP IENNCQHLKT THT FRVKNLNPKKFS IHDQDHKVLVLDS GNL IAVPDKNYIRPE I 
FFALAS SLSSASAEKGS PI LLGVSKGE FCLYCDKDKGQSHPS LQLKKEKLMKLAAQKESARRP F I 
FYRAQVGSWNMLESAAHPGWFI CTSCNCNEPVGVTDKFENRKHIEFS FQPVCKAEMS PSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



CTAGAGAGTATAGGGCAGAAGGAT GGCAGAT GAGTGAC T CCACATCCAGAGC TGCCTCCCT TTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
C T GGCAT T T CCACGAGCAAAGGGAC TGTGATGAACACAATGT CATGGC T C GT TACC T CCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
U s GGGGAGAACTAGGTGTGGGAAAT T TGAAGAC GACAT TGACAAC TGCCAT T TCCAAGAAAGCACAG 

p AGC T GAACAATAC T T TCACCTGCTTCTTCACCATCAGCACC AGGCC C TGGATGACT CAGT T CAGC 

Q C TC CT GAACAAGACCTGC T T GGAGG GAT TCCAC TGAGT GAAAC CCAC TCACAGGCT TGTC CAT GT 

~ \ GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
X%\ TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
gj ACACATC T T GAGCC TAATCATGTAGT GTAGATCAT TAAACAT CAGCAT T TTAAGAAAAAAAAAAA 

H< AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?V^^ 



ax i 

'■•■-•A 



r 5 * 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQI LLI YAWHFHEQRDCDEHNVMARYLPATVEFAVHT FNQQSKDY 
YAYRLGH ILNSWKEQVESKTVFSMELLLGRTRCGKFEDDI DNCH FQES TELNNTFTC FFT I S TRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 



CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCG ATGA GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GAC T T CGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTAC T GGCAGTTCCCC GAT GGC ATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAA.CCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACAT CAGGCAC T GC GCCACC TGC TTCACAGTAC T TCCCAAC AACT CT T AGAG GT AG 
GTGTATTCCCGTTTTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
ACC AGCTAGAAAGT GGCAGAGCCAGGAT T CAACCCT GGCT T GT C TAACCC CAGG T TTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CTCTAATGAAAT TGTGAAAGCTCCATGT TTAGAAATAAATGAAAACACCTGA 
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FIGURE 146 



MRKHLSWWWLATVCMLL FSHLSAVQTRGIKHRIKWNRKALPS TAQI TEAQVAENRPGAFI KQGRK 
LD I D FGAE GNRY YEANYWQ FPDG I HYNGC S E ANVTKEAFVT GC I NAT QAANQGE FQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites . 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACT(2ATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC T T T CAGAT TCAGGGGGTACATGT GAAG GT T TG TT T TATGAGTATATTGC ATGA TGCTGAGG 
TTTGGGGT 



ill 

m 



UJ 
fiJ 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 



Important features of the protein: 
Signal peptide: 
amino acids 1-25 



N-myristoylation site. 

^ amino acids 62-68 

a 

a 

W 

m 
m 



a 
m 
ci 

UJ 

a 
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FIGURE ldQ 



G T C T C C GCGTC ACAG GAACT T CAGCAC CCACAGGG C GGACAGCGC T CCC CTC TACCTGGAGACT TGAC 
TCCCGCGCGCCCGAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTrTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAATTAAGTGCAGAAAGACCTT T GAATGAACAGAT TGCT GAA 
G CAGAAGAAGAC AAGAT TAAAAAAACATATCCT CCAGAAAACAAGCCAGGTCAGAGCAACT ATTCTTT 
T GT T GATAACTT GAACCTGCTAAAGGCAATi^ACAGAAAAGGAAAAAATT GAGAAAGAAAGACAATCTA 
TAAGAAGCTCCCCACTT GATAATAAGT TGAATGTGGAAGAT GT TGATTCAACCAAGAAT CGAAAACTG 
M : ATCGATGATTAT GACTCTACTAAGAGT GGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCA 

TCAACTAGACGGGACTCCTTTAAGCGCTGAi^GACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 
5: AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAA.CTACTTAATCTCGGCCTTATCACAGAAAGC 
> i CAAGCACATACACTGGAAGAT GAAGTAGCAGAGGT T TTACAAAAAT TAATCT CAAAGGAAGCCAACAA 

iff TTAT GAGGAGGATCCC AATAAGCCCACAAGCTGGACTGAGAATCAGGCT GGAAAAAT AC CAGAGAAAG 

y) TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
M T TAACCTTGACAAATGGCTTGGAAAGGAGArVCTAAAACCTACAGT GAAGACAACTTTGAGGAACTCCA 

- ATATTTC C CAAAT TTCTAT GCGCTACT GAA^GTAT TGAT TCAGAAAAAGAAGCAAAAGAGAAAGAAA 

CACTGAT T ACTAT CATGAAAACACTGATT GACT TTGTGAAGATGAT GGT GAAATATGGAACAATATCT 
%l CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
|j| AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
Q CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGAT TCCACAAAA 

f|| GATGATAACTCCAACGCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 

CATCAGAAAAAATAT TGAATGGTTGAAGAA?VCATGACAAAAAGGGAAAT AAAGAAGAT T ATGACCTTT 
CAAAGAT GAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCAT CCT T GACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGC^GCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCT GAAAGTAAAGTTGT ATGT AAGC T GAAAAAAAAAAAAAA&AAAAA 
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FIGURE lso 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEK IEKERQS IRSS PLDNKLNVE DVDS TKNRKL I DDYDS TKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEYAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETLI T IMKTLIDFVKMMVKYGT I SPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE isi 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GMGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
H GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 



GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
Llj G AAAT G G C T GGAGC T C AG AAG A T AAAAGAT AAG TAG G G T AT G C T GAT C C T C T T T T AAAAAC C CAA 
jj| GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
%*. AGACC T AC T T AC AAAG T GG C AT AT AT T GC AAT T TAT T T T AAT T AAAAG ATACC TAT T TAT AT AT T 



TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
? AT T CC T T G T AT AAAAAT AAGAAAAGAAAT T AAT C T T GAG G T AAG C AGAGC AGAC AT CAT C T C T GA 

lJ TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
in' TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGT T TGT 

% GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GAT CACAGCCCC TGGGATTCCAAGGCAT T GGAT CCAGT CTCTAAGAAGGC TGC T GT AC TGG T TGA 
yj ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
Q GTCTCTGCAGATGTAGT TAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAAT TCAATAT 

f%\ GAC TGG T T TCCT TGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACT ATGTA 

AAGAT GAAGGCAGAGATCGGAGTT T T GCAGCCACAAGC TAAGAAACACCAAGGAT TGT GGCAACC 
ATCAGAAGCTTGGAAGAGGCAAAGAAGAATTCTTCCCTAGAGGCTTTAGAGGGATAACGGCTCTG 
C TGAAACC TTAATC TCAGACT T CCAGCC TCC TGAACGAAGAAAGAATAAAT TTCGGC T GT T T TAA 
GCCACCAAGGATAAT TGGT TACAGCAGC TCTAGGAAACTAATACAGCTGCTAAAAT GATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGTTATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCT AGCTGCCATGCTATGAGCAGGCCTATAAAGAGAC T TACG TGGTAAAAAATGAAGT C TCCT 
GCCCACAGCCACATTAGTGAACCTAGAAG'CAGAGACT CT GTGAGATAATCGATGTTTGTT GTTTT 
AAGT T GCT CAGTTT TGGTC T AACTTGT TATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGSE I S WPNRWLDASLS PVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKS FT FYRRDMGLTS S FESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 



N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 



Interleukin-1 signature* 

K amino acids 111-131 



f\% Interleukin-1 proteins. 

yj amino acids 8-29, 83-120, 95-134, 64-103 

y'| 



r) 
Li J 

111 
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FIGURE 15 3 



C I T CAGAACAGGT TCTCCTICCCCAGTCACCAGT TGCl'C GAG T TAGAAl' T GTCT GCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTT GGTACAGGGAGGAGCAGCT GCGCCCATCAGCTCCCACTGCAGGCT TGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGA^CTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT T GAAGGTGATGACCTGC AT AT CCAGAGGAATG T GCAAAAGC TGAAGGACACAGT GAAAAAGC 
TTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTATGTCTCTGAGAAAT 
£J GCCT GCATT TGACCAGAGCAAAGC TGAAAkAT GAATAACTAAC CCCCT T TCGCT GC TAGAAATAA 

JS* CAAT TAGAT GCCCCAAAGCGAT t T T T T T TAACCAAAAGGAAGATGGGAAGCC AAACTCCATCATG 

rfi ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
UJ AGACCAGAAG GTAGAC T T TC TAAGCATAGATAT T TATTGAT AACAT T T CAT TGT AACT G GTGT T C 

W'f T AT AC AC AGAAAAC AAT T TAT T T T T TAAAT AAT TGTCTTTTTC CAT AAAAAAGAT T AC T T T CC AT 

f\ TCCT TTAGGGGAAAAAACCC CTAAATAGC T TCATG T TTCCATAAT CAG TACT T T AT AT TTATAAA 

T G TAT T TAT TAT T AT TAT AAGAC T GCAT T T TAT T TAT AT CAT T T TAT T AAT ATGGAT T TAT T TAT 

r 

gj AGAAACATCATTCGATATTGCTACT TGAGTGTAAGGCTAATAT TGATATT TATGACAATAATTAT 

Lfl AGAGCTAT AACAT GTT TATT TGACC T CAATAAACAC TTGGATATCC C 

yj 
ill 
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FIGURE 1*4 

MAALQKSVSS FLMGTLATSCLLLLALLVQGGAAAPI SSHCRLDKSNFQQPYI TNRT FMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYIMKQ^/LNFTLEEVLFPQSDRFQPYMQEVVPFLARLSNRLS 
TCH I E GDDLH I QRNVQKLKDT VKKLGE S GE I KAI GEL DL L EMS LRNAC I 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 



f" N-glycosylation sites. 

2{ amino acids 54-58, 68-72, 97-101 

iji N-myristoylation sites. 

tfl amino acids 14-20, 82-88 



Prokaryotic membrane lipoprotein lipid attachment site. 



'f»t amino acids 10-21 

fsaf 
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FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTMCCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGG 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGAT TAGGT GAGGACAGT TCTCTCAT TAGCCTTTTCCTACAGGTGGT TGCAT 
T C T T G GCAAT G G T CAT GG G AAC CC AC ACC TACAGC CAC T GGC C CAGC TGC TGC C CC AG CAAAG GG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACA(jACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCC CAGCACAGGCAC T T TCT AGAT AT T T CCCCCT T GCTGGAGAAGAAAGAGCCC CT GGTT TTAT T 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T TGACAT GGATGAT TCT GAGGAGGAAGC TGT TAT TGAATG TATAGAGATT TATC CAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 



MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 



Important features of the protein: 
Signal peptide: 

amino acids 1-32 

ht'v N-glycosylation site. 

amino acids 136-140 

Si 

l-j Tyrosine kinase phosphorylation site. 

jj| amino acids 127-135 

r * N-myr i s toyl a ti on s i te s . 

amino acids 4 4-50, 150-156 

if! 
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FIGURE 157 

CCGGCG ATG TCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACTTGAGGGACCTCCGAGTAGAACCTGTTACAACTAGTGTTGCAACAGGGGACTATTCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCATTGGGGCCCATAA.TATTCCTAATGCAAATATGAATGAAGATGG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGT GTGT CAAGGCCGGAAGCCT GT GGGAT CC GAACAT CACT GC T TGTAAGAAGAAT GAGGAGACA 
Q GTAGAAGTGAA.CTTCACAACCACTCCCCTGGGAAACAGATACATGGGTGTTATCCAACACAGCAC 
fii TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
■;V S AGC GAC T GCAT CCGACAT AAAG GAAC AG TTGTGCTCTG C CCAC AAACAGG CGTC CC T T TCCC TC T 

m GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
jwfc CAT GGGT GC T GG TGGC AG GGAT C T AT CTAA.T G T GGAGGCACGAAAGGATC AAGAAGAC T T C C T T T 

(S TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGA7VATATGTTTCCA 
'fjj TCACACAATTTGTTACTTCACT GAATTTCTTCAAAACCAT TGCAGAAGTGAGGT CATCCT TGAAA 

2| AGT GGCAGAAAAAGAAAATAGCAGAGAT G GGT C CAGT GCAGT GGCT TGCCACTCAAAAGAAGGCA 

jjj GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
CJ GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
Til GTGAT CT AAGAAGCCAGATT CATCTGCACAAATAC G T GGTGGT CTACT TTAGAGAGAT TGATACA 

AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADAS IRLLK&TKICVTGKSNFQSYSCVRCNYT EAFQTQTRPSGGKWTFS Y IGFPVELNTVYFIGAHNI P 
NANMNE DG PSMS VNFT S PGCLDHIMKYKKKCVKAGSLWDPN I TAC KKNEET VEVNFTTTPLGNRYMAL IQH 
ST I IGFSQVFEPHQKKQTRASVVIPVTGDSEGAT VQLTPY FPTCGSDCIRHKGT WLCPQTGVP FPLDNNK 
SKPGGWLPLLLLSLLVATOTLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRS EV I LE KWQ KKKI AEMGPVQWLATQKKAADKW FLLSNDVN S VCDGTCGKS EG S PSENS QDL FPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

'f*\ Important features of the protein: 

•22 f 

Signal peptide: 

Qjl amino acids 1-14 



Ufi Transmembrane domain: 

amino acids 290-309 

! 

% 

^ N-glycosylation sites. 

yi\ amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 

%\ - 287 

SS5J? 

Q cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

jjj amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 



N-myristoylation site. 

amino acids 116-122 



Amidation site. 

amino acids 488-452 
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FIGURE iso 



CI 



jssjs 



m 



ill 



=44 

AGCCACCAGCGCAACATG&CAGTGAAGACCCT^ 

GTCGATATTGGGGCTTGCCTTTCTGAGTGilGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGAGATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCA?\GGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCA GTAA GAGGTGCATATCC 
ACTCAGCTGAAGAAG 
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FIGURE 160 

MTVKTLHGPAMVKYLLLS ILGLAFLSEAAARKI PKVGHT FFQKPESCPPVPGGSMKLDI GI INEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSYPIQQETLW 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence : 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGT GCACAAAC TACCCAGCACAGCCCCCTCC GC CCCC TCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGTTTGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
M TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
Q GGAACTCAACCAGACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
%\ ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
%f. GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
^| CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
yj CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
111 CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
fj{ GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
f'f TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
r ?:? AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
S GGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
p GTGGC T GT ACT T CACTAC CCAGCAAAGCC T CC ACGAGGGC AGC T CGCC T T GGAGAGTACT T AC TA 

111 CAAGACCT GCAG TCAGGC CAG T G T CT GCAGCTAT GGGAC GAT GACT TGGGAGCGCT AT GGGCCTG 

%\ CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
^ CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
III CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
gj CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
H I CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
? • 6 CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAA TAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 



hrJ 

□ 



I Vi ■ 



r 



CI 
UJ 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCS PGLSCRLWDS DILCLPGDI VPAPGPVLAPT HLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQVVLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEEQjQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKY I H KRWALVWL ACLL FAAAL5LI LLLKKDHAKGWL RLLKQDVRSGAAARG 
BAALLL Y S ADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQG PVAW FHAQ RRQTLQEGGWVLL FS P 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSY FHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 



fit Transmembrane domain. 



amino acids 453-475 
N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site, 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

. amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCC^GCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAftAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGT GCGGGAGGCCGG TCAGCCACCAA3ATGACT GACAGG T T CAGCTC TC TGCAGC ACACTAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TC TACC ACT TAGAGC TCCAGGTCAACCG CACC TACCAAATGCACC T TGGAGG GAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
r% CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
pi TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
y| CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
i|| GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
J-i GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
: fj CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
H CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
g TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
ri CCCTGGGGATT TGCACAGACAGAACATCTGACCCAAATGT GCTACACAGTGGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
;{ ; ; CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
CI TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
UJ GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
p AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
%] ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
! 5 * AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTT^ACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
T TC CC AGCC AGGGCAACTGCCT GACGTTGCACGATT T CAGCT TCAT TCCT C TGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCTTTTCTGCAGGCAGGAGTTTCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 164. 

MRTLLT I LTVGS LAAHAPEDPSDLLQHVKFQS SNFENI LTWDS GPEGTPDTVYS IE YKT YGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYAI^VTAVSAGGRSATKMTDRFS S LQHT TLKPPDVTCI S 
KVRS I QMI VHPTPT P IRAGDGHRLTLEDI FHDL FYHLELQVNRT YQMHLGGKQRE YE FFGLTPDT 
EFLGT IMI CVPTWAKESAPYMCRVKTLPDRTWTYS FSGAFLFSMGFLVAVLCYLS YRYVTKPPAP 
PNS LNVQRVL T FQPLRF I QE HVL I PVFDL SGP S S LAQPVQYS Q I RVS G PRE PAGAP QRHS L S E I T 
YLGQPDI S I LQPSNVPPPQI LSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAI SKVQPSS Y 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myr i s toyl a tion sites . 

amino acids 11-16> 47-52, 102-107, 531-536, 565-570 
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FIGURE i6s 

TGGCCTACTGGftAAAAAAA^ 

CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCMAGTGTGCGGGGACGAAGAGTGCAGCAT 
■ GTTftATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTT-TA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGGAGGGGGATCCCTTGAACTTTGGGCTGGA 
AG T G T T G AAC ACAG T T T T G GAT AT T T T CC. AAAAGAT T T GAT C AAGG T AC T T C AT AAAT AC AC G GA 
AGAAGAG C T AC AT AT TCCAGCAGAT GAGACAGAC TTTGTCTGCTTT GAAG GAGGAAGAGAT G A T T 
TTAATAGTTATAATGTAGAAGAGCTTTTAGGATCTTTGGAACTGGAGGACTCTGTACCTGAAGAG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCT GC GGC TAAC GC T C AGGGAGTGCAGT C T TCG T TGGACACT T T TGAAGAAAT T C T G C AC GATAA 
AT T GAAAG T G CCGG GAAG CGAAAGCAGAAC TGGCAAT AGT T C T CC T GCC TCGG T GGAG CG G GAGA 
AGACAGATGC T TACAAAG T C CTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGT T AT T 
CAT T ACAGC AAAGG AT T T C G T T G GC AT CAAAAT CT AAG T T T G T T T T AC AAAG AT T GT T T T TAGT A 

CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 



App_ID=10063581 



Page 3 17 



FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHS FGYFPKDLIKVLHKYTEEELHI PADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 




N-glycosylation site. 



amino acids 294-298 



cAMP- and cGMP- dependent protein kinase phosphorylation site. 



amino acids 30-34 



Tyrosine kinase phosphorylation site. 



amino acids 67-76 



ill 



N-myristoylation sites. 



amino acids 205-211 f 225-231, 277-283 



Amidation site. 



amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGjGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAAGGCGCACCCTCGAAGTCTTGAA.CTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGG'C 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCG ATGA GCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
u ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
Hi TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
Si ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
% CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
uj CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
rj5 GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
fli ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
U GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
CCAGAAAAGAAfiuGTGGCATGGAAGT ATAA TAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
PI 1 AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTATVAGTAATTTTTATATGTCTATTATTTCA 
2| TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
uj ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
□ TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
fjj TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA. 

ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACTTATTACTGTTGTCAACTGAATTCACACACACACAAA.TATAGTACCATAGAAAAA.GTTTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTpTTCAGA 
AAT AAGAAG C TAT T T CAT T AAG T G T G A T AT AAAC C T C C T C AAAC AT T T T AC T T AGAG GCJAAG GAT 

tgtctaatttcaattgtgcaagacatgtgccttataattatttttagcttaaaattaaAcAgatt 
ttgtaataatgtaactttgttaataggtgcataaacactaatgcagtcaatttgaacaaaagaag 
tgacatacacaatataaatcatatgtcttcacacgttgcctatataatgagaagcagc^ctctga 
gggttctgaaatcaatgtggtccctctcttgcccactaaacaaagatggttgttcgggc&ttggg 
attgacactggaggcagatagttgcaaagttagtctaaggtttccctagctgtatttagtctctg 
actatattagtatacaaagaggtcatgtggttgagaccaggtgaatagtcactatcagtgtggag 
acaagcacagcacacagacattttaggaagga^aggaactacgaaatcgtgtgaaaatgggttgg 
aacccatcagtgatcgcatattcattgatgagggtttgcttgagatagaaaatggtggctccttt 
ctgtcttatctcctagtttcttcaatgcttacgccttgttcttctcaagagaaagttgtaactct 
ctggtcttcatatgtccctgtgctccttttaaccaaataaagagttcttgtttctgggggasaaa 
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FIGURE 168 

MSRWSLLLGAALLCGHGAFCRRWSGQKVCFAEiFKHPCYKMAYFHELSSRVSFQEARLAC 

GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 

YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 

PYLTNQPGDTHQ3STVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 

STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 



M Transmembrane domain: 

m 

\Vi amino acids 214-235 



N-glycosylation sites. 

amino acids 86-89, 255-258 



W cAMP- and cQtfP -dependent protein kinase phosphorylation site. 

%l amino acids. 2 66-269 

ill ; 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114,- 140- 
145, 212-217 r < : 
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